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ABSTRACT 

In this paper, the second in a series of two, we justify two important assumptions on 
which the result is based that in course of a galaxy merger the slingshot ejection of 
bound stars is sufficiently efficient to allow a supermassive black hole binary to merge. 
A steep cusp with a power law index of 2.5-3 is required which is as massive as the 
binary and surrounds the BHs when the binary becomes hard. This cusp is probably 
formed when both clusters, surrounding each black hole, merge and combine with the 
matter funneled into the center. We find this profile to be in agreement with observed 
post-merger distributions after the cusp has been destroyed. The time dependency 
we derive for the merger predicts that stalled black holes, if they exist at all, will 
preferably be found in less than ~ 0.2 pc distance. To test this prediction we compute 
the current semimajor axis of 12 candidates of ongoing mergers. We find all binaries 
unambiguously to be already in the last phase when they decay due to the emission of 
gravitational waves. Therefore, in striking contradiction with predictions of a depleted 
loss-cone, the abscence of even a single source in the slingshot phase strongly supports 
our previous and current results: Binaries merge due to slingshot ejection of stars 
which have been funneled into the central regions in course of a galaxy collision. 

Key words: black hole physics - galaxies: evolution - galaxies: interaction - galaxies: 
kinematics and dynamics 



1 INTRODUCTION 

The formation of a supermassive black hole binary (BHB) 
is the natural consequence of two widely accepted assump- 
tions: Galaxies harbour a supermassive black hole (BH) in 
their center and galaxies merge with each other. Such BHBs, 
merged and not yet merged, are important because they 
are used to explain a wide variety of features observed in 
galaxies. For a deta i led re view on observational e vidence of 
BHBs see iKomossal (j2003l ) or an updated version, IKomossal 
(2006). The evolution of the mergi ng BHs can be subd i- 
vided into three successive phases l|Begelman et al.H l980): 
In the beginning both cores spiral inwards to their com- 
mon center due to dynamical friction. Once the BHs bind 
to each other on the parsec scale and form a hard binary 
they keep on merging due to slingshot ejection of stars. Fi- 
nally, in the third phase, the binary continues to decay owing 
to the emission of gravitational waves. While the first and 
third phase are well investigated it is still a matter of de- 
bate whether the slingshot ejection of stars in the second 
phase is efficient enough to enable the binary to enter the 
final phase or whether the merging process comes to a halt 
due to loss-cone depletion. Even though numerical scatter- 
ing experiments showed that the BHs merge on scales of 
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10 8 ~%r (lQuinlanlll996l: lMakinolll997l: iQuinlan fc Hernqmstl 
ll997l : lMilosavlievic fc Merrittll200ll : IZier fc Biermand^OOll ) 
it is argued in all publications but the last that the loss-cone 
becomes depleted long before the binary enters the third 
phase and the binary probably gets stalled. This reasoning 
is based on the assumption that the binary is embedded in 
a flat spherically symmetric core which is der ived from the 
centr al density profiles of elliptical galaxies (|Berczik et alj 
2005). According to hierarchical models for galaxy formation 
this type of galaxy has experienced a major merger previ- 
ously and therefore its mass has been redistributed from the 
central parts to the outer regions, resulting in a flat profile 
after the merger. 

While there is no conclusive observational evidence for 
stalled binaries various so urces suggest a general l y suc - 
cessful merger of the BHs. lHaehnelt fc Kauffmannl |2002) 
argued that if the merging time would exceed a Hubble 
time the binary should become ejected in about 40% of 
bright ellipticals when merging with a third galaxy. They 
pointed out that this would be in contradiction with the 
BHs which have been observed indirectly in all nearby el- 
liptical galaxi es and with the small scatter of the Mbh - ff * 
relation (e.g. iGebhardt et~ai1 l2000l : iTremaine et ail |2002| ). 
A certain class of sources, the so-called X-shaped ra- 
dio galaxies (XRGs), can be well explained in terms of 
a completed merger of a BHB, an interpretation first 
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used by iRottmarml (|200lh ■ When the BHs finally coa- 
lesce the spin axis is rapidly realigned into the direction 
of the orbital angular momentum so t hat the old and 
new l obes appear as an X on the sky llZier fc Biermannl 
200 ll; iDennett-Thorpe et all |2002| ; IZier fc Biermannl l2002t 
Gereelv fc Biermannl l2007h . During the third phase the 



rapidly precessing jet produces effectively a powerful wind, 
which entrains the e n viron mental gas and is identified by 
Gereelv fc Biermannl (12007m with a superdisc, dicussed by 
Gopal-Krishna. et alj (120071 ). A merger is also held re- 



sponsible for Z-shaped radio galaxies, where the secondary 
galaxy bends the .j et of the primary into a Z- s hape befor e 
the BHs coalesce (|Gopal-Krishna. et al.l 120031; IZierl I2005I) . 
for double-doub le radio galaxies ( Schoenmakers et alj 2000l ; 
iLiu et all l2003h and possibl y for compact symmetric ob- 
iects |zier fc Biermann|[2002l ) . Recently the sample o f know n 
XRGs has been increased considerably by ICheund ()2007h . 
This can be used for systematic studies and hence might 
support the merging scenario as formation mechanism of 
XRGs. Other sources suggest that the BHs have not yet 
merged and are still orbiting around each other. Helical jet 
patte rns could be explained in this way (|Begelman et al.l 



198fJ) as well a s sem i -periodic ch anges in lightcurves (e.g 



Sillanpaa et "all Il988l : iKat A Il997l ). However, this does not 



necessarily mean that the BHs are stalled, the binary might 
still decay. 

In a recent letter (fzierl 12006. from now on IPaper & wo 
showed that the slingshot ejection of stars in the second 
phase is efficient enough to allow the BHs to shrink to the 
third phase and coalesce within less than a Hubble time. Un- 
like in previous numerical simulations where the focus was 
on stars scattered off the binary, we focused on the stars 
bound in the potential of the BHs. The results showed that 
if the binary by the time it becomes hard is surrounded by 
a flat cusp with a power law index 7 < 2, as it appears after 
the merger and has been used in previous simulations, it will 
stall in this phase unless the cusp is very massive. However, 
we predict that the cusp is as massive as the binary and suf- 
ficiently steep (7 > 2.5) during the merger when the binary 
becomes hard. The ejection of this mass out of the poten- 
tial of the BHs extracts enough energy so that the binary 
can enter the third phase and the BHs coalesce. We argued 
that such a profile is formed during the merger. Parameters 
like the initial mass and velocity distributions in the iso- 
lated galaxies as well as the magnitude and orientation of 
both galactic spins and the orbital angular momentum rela- 
tive to each other have a strong in fluence on the merger an d 
the morphology of the remnant l|Toomre fc Toomrelll972l ). 
While the galaxies are merging energy is dissipated and an- 
gular momentum redistributed with some fractions compen- 
sating each ot her. Large amounts of mas s move on highly ec- 
centric orbits (|Rauch fc Tremainel fl996) in a potential that 
is stronlgy non-spherically symmetric. Low angular momen- 
tum matter accumulates in the center. Together with both 
cores which surround each BH a nd whose density increases 
considerably during the merger l|Barnes fc Hernquisdll996h 
this matter forms a massive and steep cusp by the time the 
binary becomes hard. This cusp is only transient because it 
will be destroyed by the merging binary and therefore is not 
likely to be observed. However, this should be the appropri- 
ate profile in order to simulate the second phase. 

In the present article, after repeating in Section [2] the 



results from lPaper J which we will need in this paper, we will 
justify in more detail our assumptions for the kick-parameter 
k (Section [3]) and the neglegt of the cluster potential (Sec- 
tion [4]). Afterwards we show that the fraction of mass which 
is required to become ejected is in agreement with the to- 
tal mass of the galaxy. We also show that the profile which 
at the beginning of the second phase is required to allow 
the BHs to coalesce is in agreement with the observed post- 
merger profiles after the binary has destroyed the cusp (Sec- 
tion [5j. In Section [6] we briefly consider the evolution of 
a merger in time before we compare the effects of multi- 
ple mergers on the ejected mass with numerical simulations 
in Section [7] Afterwards we examine observational evidence 
for ongoing mergers which might have become stalled (Sec- 
tion [H| and finally summarize our results in Section [5] 



2 PRELIMINARIES 

First we repeat the basic assumptions and some results from 
IPaper ll which we will use in the present article. It is assumed 
that the BHs, moving on Keplerian orbits, have formed a 
hard binary and that the origin coincides with the center of 
mass. We define the mass ratio q = mz/mi < 1. The total 
and reduced mass are M12 = mi + rri2 and /1 = m-imz/Mu, 
respectively. Hence, the energy of the binary is 



£"biii = — - 
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and the relative velocity between the BHs is 



GMvi 



(1) 



(2) 



where a is the semimajor axis of the binary. If the clus- 
ter mass M c is distributed according to the power law 
P = po(r / Vq) -7 between the radii r- x and r c with 
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(4) 



Note that in the following we will refer to the logarithmic 
slopes o f density profiles, dlogp/d logr, simply as slopes. In 
IPaper J . we showed that a mass of about 2M12 is bound to 
the binary, of which a large fraction is expected to be in the 
loss-cone. For stars belonging to this population, the initial 
energy in the potential of the binary is 



GM 12 m, 
2r- ' 



(5) 



where e < 1 is the eccentricity of the star's orbit and r_ the 
pericenter. We approximated the potential of the binary to 
first order with a point potential of mass M12 located at the 
center of the cluster. This introduces only minor deviations 
with a maximum of a factor of less than 2 for q = 1. In 
Section [4j we will show that it is justified to neglect the 
potential of the cluster itself when computing the binding 
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energy of the stars in Eq. ((5j). After its ejection the formerly 
bound star will have a positive energy which we can scale 
with the factor k to its initial energy for circular orbits (e = 
0): 



E, 



GMum* 
1 2r_ • 



(6) 



According to iQuinlanl (| 19961 ) the dominant contribution to 
the hardening of the binary comes from stars whose percen- 
ter is about the semimajor axis of the binary, independent 
of the density profile. Hence replacing r_ with a we obtain 
for the energy change E„ t f — E* t i of the star: 



AE t = (1 - e + k) 



GM 12 m, 
2~a 



Note that for pericenters smaller than a the initial energy of 
the star would be smaller and therefore the energy change 
larger, resulting in an increased kick-parameter k. 

In the limit m* <C mi we can replace m* with dm and 
write Eq. J7J in its infinitesimal form. Equating it with the 
change of the binary's energy in Eq. |T} due to the ejection 
of the mass dm yields the differential equation 



da \J^ rn 
a (jl ' 



(8) 



which relates the shrinking of the binary to the amount 
of the eje cted mas s. Integrating Eq. from a g to Oh we 
showed in lPaperll that the binary has to eject the mass 



/i a h 
= fc ln — 



(9) 



The semimajor axis where the binary becomes hard, ah, and 
where emission of gravitational waves starts to dominate the 
decay, a g , mark the transitions from phase 1 to 2 and phase 
2 to 3, respectively. While a g is well defined (see Eq. (|75[) ). 
there is no unique prescription for the semimajor axis where 
the binary becomes hard. However, the ratio of these dis- 
tances 77 = ah/a g is agreed to range from about 20 to 100, 
see the discussion in IPaperll and references therein. In the 
following we assume that the binary is hard and do not worry 
about the exact value of ah- We showed that the ejection of 
about m c j is sufficient for the binary to shrink from ah to 
a g , provided this mass is distributed according to a steep 
power law with an index 7 > 2.5. Therefore, we concluded, 
the coalescence of the BHs is very likely in the course of a 
galaxy merger where a large amount of mass with low an- 
gular momentum is accumulated in the central region. 



3 THE KICK-PARAMETER k 



One of our basic assumptions in lPaperll is a kick-parameter 
k = 1. In the literature we can find various prescriptions 
for k which we defined in Eq. (J7J). If we express the final 
energy of the ejected star in terms of its velocity at infinity, 
-Z5»,f = m.D^j/2 we can write k = (voo/v^) 2 , where we made 
use of Eqs. © and © and replaced the pericenter of the 
orbit of the star with the semimajor axis of the binary. This 
form can be used in the relation between the scaling param- 
eters obtained f rom Eq. 0, k = 1 — e + n. From scatter- 
ing experiments IQuinlanl jl996h finds that most of the stars 
are ejected with a final velocity Voc ~ (3/2)v M y mi/Mn = 



(3/2) Vfi \fq/{l + q) and hence we obtain 
9 a 



k 



■e+- 



1 



(10) 



IQuinlanl argued that the energy gained by a star can basi- 
cally be attributed to the interaction with the smaller BH 
in the limit mi 3> TO2, because the larger BH acts as a 
fixed potential. He then derives an expression for the en- 
ergy change which is proportional to mi/Mn = g/(l + q). 
However, because mi and mi are bound to each other also 
the acting forces correspond to each other so that the larger 
mass mi compensates for the smaller semimajor axis of its 
orbit. Therefore it does not seem to be justified to approxi- 
mate it as a fixed point potential compared to the potential 
generated by mi. The potential of the BHs is $i = Gmi/rt. 
As they move along their orbits a test mass m*, which is 
fixed in space, experiences a change in the potential which 
is proportional to the displacement of the BHs: 



„ Gm t 
d$i = — *-dn 



(11) 



We assume that the displacement aVi of the mass m; cor- 
responds to the semimajor axis of its orbit dj. Expressing 
this in terms of the semimajor axis of the binary a, i.e. 
ai = ag/(l + g) and ai = a/(l + g), we can write 



Gnu q 
d$i = — — a- 



Gm 2 1 
d$2 = — — a- 



'1 1 + g' " rl ~l + q- (12) 

The binary shrinks mostly due to the interaction with 
stars wh ose closest app roach corresponds to the semima- 
jor axis (IQuinlanl \W9& ). Hence we can replace n and ri 
with a, resulting in equal changes of both potentials, A«I> = 
(Gmi/a) q/(l + q) = v 2 fi/Mn- If the star was moving on a 
parabolic orbit and this energy is tranfered from both BHs 
to the star its final velocity is v~o = u M \j2\ijM\i so that 



2-^=2 q - 

M12 (l + g) 2 ' 



(13) 



This result has bee n cited before to have been obtained by 
ISaslaw et al. ( 1974) in numerical experiments. Apart from 
a factor 1/(1 + g), which has a minimum o f 1/2 for q = 1, 
this result is very similar to that obtained bv lQuinlanl above. 
Similar values have also been found before. In simulations 
of close encounte rs of stars with a h a rd eq ual-mass binary of 
zero eccentricity, iHills fc Fullertonl (|l98fj| ) obtained for the 
mean velocity of stars at in finity Voo ~ 0.84 v M , and hence 
k w 0.71. Later IRoosI (|l98ll ) performed numerical computa- 
tions with a varying mass ratio and approximated the kick- 
parameter with k = 2^,/ M11, the same result we have found 
above for k. For parabolic orbits, as we have assumed above, 
these parameters are equal and our crude estimate is in good 
agreement with his resu lt. 

More recently Izierl (|2000h simulated a stellar cluster 
bound in the potential of a BHB which is moving on fixed 
circular orbits. He carried out several runs for different mass 
ratios (g = 0.01, 0.1 and 1) and various initial density pro- 
files of the stars (Gaussian or power laws with index 7 = 2 
or 4). For every run we binned the initial eccentricities of 
the orbits and computed the kick-parameter for each bin. 
In Fig. [TJ we show the thus obtained k in dependency on 
the eccentricity and find a distinct linear correlation. We 
can not detect a clear dependency on the kind of the initial 
profile in the plot. However, the data show a weak positive 
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Figure 1. Data show that the kick-parameter is roughly a linear 
function of the eccentricity of the stellar orbit. While k tends to 
increase with q it does not seem to depend on the choice of the 
initial profile. The fit is drawn by eye using all data points. 



dependency of k on the mass ratio q. This is in agreement 
with the previous results given above in Eqs. (|10[) and (|13[) 
if we write k — 1 — e + k, although the dependency on q ex- 
hibited in the data is weaker. However, we find that k « 1 is 
a good approx imation for e < 0.4. Note that the simulations 
bv lZierl (|2000l ) did not take into account the potential of the 
stellar cluster and hence the real values for k should actually 
be larger than those displayed in Fig. [T] see next Section. 
Because Eq. (|10|) is less steep than the data in Fig. [T]it sug- 
gests this equation generally yields slightly larger values for 
the kick-parameter if the mass ratio is not too small. 

Co mparing our definition of the kick-parameter with 
that of[Yu| |2002l ) for K we find k = 2Ka/(l + qf . Making 
use of the results of iQuinlanl l|l99fj ). lYul obtains K ~ 1.6. 
This translates to a maximum value of k = 0.8 for 5=1, 
ro ughly in agreement with the previous results. Note that 
in IPaper II we did not include the factor = q/(l + q)' 2 

and so derived a too large value of k = 3.2. Because we just 
quoted this result and used k = 1 throughout the paper none 
of the results and conclusions obtained there are affected. 

In Eq. (O we defined k after having replaced the peri- 
center of the star r_ with the semimajor axis a of the binary. 
Because only stars with r_ < a can interact with the bi- 
nary and become ejected and a pericenter less then a would 
increase the kick-parameter (keeping the eccentricity con- 
stant) the values we derived should actually be a lower limit. 
In conclusion we can say that the above results clearly show 
that k is of order of 1 unless the stars are moving on very 
eccentric orbits (keeping r_ = a constant, i.e. stars are only 
weakly bound, what is very unlikely due to dissipation of 
energy during the merger) and th e mass r atio is very small. 
Therefore our choice of k = 1 in IPaper il was well justified 
and we continue to use this value in the present paper. 



3.1 The influence of the cluster potential on k 

After a star interacted with the binary it will be ejected 
from the potential of the BHs. This might not happen after 
the first interaction, but ultimately it will be ejected unless 
before the next encounter with the binary the pericenter is 
shifted due to star-star interactions to distances too large as 
to interact with the binary. Or the binary has shrunk in the 
meantime to a semimajor axis much smaller than the peri- 
center of the star, again resulting in no more interactions. 
However, this will happen most likely only in the beginning 
of the merger when the evolution is fastest (see Section [6} to 
stars whose energy is close to zero after the last interaction 
so that they have been almost ejected anyway. 

Interacting with the binary some stars will be acceler- 
ated to a speed which exceeds the escape velocity of the 
binary, but is less than the escape velocity of the combined 
potentials of the BHs and the cluster. These stars stay bound 
to the center. After multiple interactions with the binary the 
fraction of stars whose pericenters have become larger rel- 
ative to the semimajor axis of the binary for the reasons 
given above will remain bound in the cluster without in- 
teracting with the binary anymore. On longer time scales 
they might diffuse back into the loss-cone. The other frac- 
tion eventually becomes ejected from the total potential of 
the BHs and cluster after multiple encounters. Hence this 
delayed ejected fraction, emerging because of the inclusion 
of the cluster potential, increases the kick-parameter on av- 
erage. At radii r > r c the star's specific energy is 

G(M 12 + M c ) 



GM 12 



(14) 



(1 + /), 



where / = M c /Mi2, with / > 1. The escape velocity in the 
combined potential at r > r c is v 2 sc = 2(1 + /) GM12/V and 
for the velocity of the star in the potential of the binary only 
we can write v 2 (r) = 2(£* +GM\2/r). If we require that this 
velocity is at least as large as the escape velocity at r — r c we 
obtain for the specific energy the relation £ t > f GMi2/r c . 
Using this again in the expression for the star's velocity in 
the limit of an infinite radius yields v 2 ^ = 2£» > 2f a/r c . 
Therefore the condition that the stars become ejected from 
the combined potential of the binary and cluster can be 
written as 



K>2f-. 



(15) 



k is determined using only the binary potential. For a star 
which eventually escapes from the binary and the cluster this 
parameter is increasing with the normalized cluster mass /. 
In comparision with a neglected cluster potential (/ = 0) 
Eq. (|15p shows that including this potential increases k. This 
results in a larger kick-parameter of the delayed ejected frac- 
tion and therefore also of the mean value of k of all ejected 
stars. On the other hand we can use Eq. (|15p to derive a 
maximum cluster mass for which stars will be ejected, 

, s a«_ai±pi. 

With our results from the previous Section and assuming 
that r c 3> a this relation still allows the cluster mass to ex- 
ceed the binary's mass by a factor of a few, as required by 
a successful merger. Therefore most of the stars get a kick 
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large enough to escape from the center, even if we neglect 
the cluster potential. Consequently the fraction of delayed 
ejected stars is small, only slightly in creasing the mean of 
k. This is in agreement with lYul l|2002r ) who finds that espe- 
cially for a < oi, stars generally escape from the binary and 
cluster. We can summarize that including the cluster poten- 
tial tends to increase on average the value of the parameter 
k so that the values obtained above are rather lower limits. 



4 THE CLUSTER POTENTIAL 

We also have to check whether it is justified to neglect the 
cluster potential when computing the potential energy of 
stars bo und by the binary, our second basic assumption in 
I Paper J . If initially stars in the cluster are not bound by 
the BHs, they are at least bound by the cluster itself. The 
energy of a star before and after the interaction with the 
BHs is 

E, ti = y«i 2 -m($ bin + «I>c) (17) 

£,,f = y%-m($ bin + $ c ), (18) 

with $ c and $bin being the potential of the cluster and the 
binary, respectively. If we compare the initial and final en- 
ergy of the star at the same radius and assume that both 
potentials did not change during the time of the interaction 
between the binary and the star we can write for the change 
of the energy 



E, i 



- v i )• 



Taking the circular velocity in a point potentia l as the t ypi- 
cal velocity of a star in the cluster we showed in lPaperll that 
a mass of about 2Mra is gravitationally bound to the binary. 
Depending on the power law index of the mass distribution 
the radius of a sphere which contains twice the mass of the 
binary is (Eq. @) 



n> = n < 



1 + 2 



Mi2 



2M 12 /M C 



(ft) 



3-7 



1/(3-7) 



7/3 
7 = 3. 



(19) 



For a valid solution of course the relation M c > 2Mi2 must 
be satisfied. In this range the radius r b is increasing with 
decreasing 7, i.e. larger for flatter profiles. For 7 = 2 and 
r; <C r c we have r b = 2 r c Mi 2 /M c , independent of n. If the 
cluster is four times as massive as the binary and we assume 
r c w 100 pc we obtain for r b about 50 pc, i.e. a radius much 
larger than the semimajor axis ah where the binary becomes 
hard. For 7 = 3 and M c = 4Mi2 we find r b = ^/rlrv, i.e. the 
geometrical mean. With r; = 0.01 and r c = 100 pc this is 
1 pc. Thus a mass of 2M12 is contained in the central cusp 
and bo und to th e binary when it becomes hard. 

In I Paper J we assumed the star to be bound to the 
binary, i.e. E,.i is negative even if <E> C is neglected in Eq. (I17p . 
Then Vi is less than the escape velocity from the binary what 
is roughly true for stars within the sphere of radius ~ r b . It 
is this bound population on which we focused in our previous 
paper and which we also consider in the present work. When 
we calculated the energy of the stars and derived the mass 



which is required to be ejected in order to extract sufficient 
energy so that the BHs merge, we neglected the potential 
of the cluster. This we will justify here by comparing the 
energy of a star in the potential of the binary with its energy 
in the potential of a stellar cluster whose mass is distributed 
according to a power law with index 2 or 3. 

In case of p = po(r /ro)~ 2 we can write Poisson's equa- 
tion as 

V 2 $ c =47rGp = s/r 2 , (20) 

where we have introduced the constant s = 47rGpo- The 
stars in the cluster are distributed between n and r c , the 
inner and the cluster radius, respectively. With M c being 
the total mass of the cluster we can write s = GM c /(r c — n). 
Integrating Eq. (|20jl twice we obtain for the potential in the 
range n < r < r c 



$ c (r) = $ c (n) + s 



, r n 
In — + - 

Vi r 



9£ 

dr 



(21) 



Because the mass is spherically symmetric distributed and 
there is no mass within n, no force is acting on a particle in 
this range. Therefore F — —d§/dr has to vanish at r — n. 
On the other hand the force acting on a particle outside the 
cluster is the same as that of a pointmass M c located at 
the origin, -GM c /r (Newton's second theorem). Evaluat- 
ing this condition at r = r c we can write the potential in 
the form 



-GM C I 



1 In Is. 

r c — n rj ' 



r c — n 



1 



< r < n 
n < r < r c 
r c < r. 



(22) 



The potential energy of t he cluster in its own potential is 
jBinnev fc Tremaine|[l99l . page 34) 



E c = 2n 



J r 2 p{r)<b{r)dr. 



(23) 



We are only interested in the stars which become ejected 
after interacting with the binary, i.e. stars in the range be- 
tween a g and ah ■ It is the energy of this fraction in the poten- 
tials of the cluster and the binary that we want to compare. 
Therefore we have to integrate Eq. (|23p in the limits from 
a g to ah and obtain 



E c =- 



GM 2 



1 



2r c (1-1/0 2 
2 77 — 1 In 77 



+ 



In A ln(Ar?) 



A r\ £ A Xr/ 
In this expression we used the following definitions 



a„ 



A ; 



which can be combined to 



n 



A77' 



r c 
Oh ' 



1 



(24) 



(25) 



(26) 



The potential energy of this fraction of the cluster (a g < r < 
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£"bin = 



Oh) in the potential of the binary, which we approximated 
by that of a pointmass M\ 2 at the origin, is 

GM 12 M C a h _ GM 12 M C In 77 
r c — n a g r c 1 — 1/C 

This energy is obtained by multiplying the binding energy 
m Eq. (11) of lPaper J with the factor 2. We assume that 
the mass of the cluster within the distance ah corresponds 



(27) 



M (oh) = M, 



oh — n a h 

: = r ln — ■ 



(28) 



Solving for the cluster mass and using the definitions in 
Eq. I|25p we obtain 



M c = H A ^iln,. 



(29) 



As we argued in IPaper J solutions with n < a g are phys- 
ically unreasonable because the mass in this range is not 
included in the mass which is interacting with and ejected 
by the binary. Therefore we assume n = a g in the following, 
implying £ = Xr). The ratio of the potential energies of the 
mass in the cusp range (o g < r < ah) then results in 



E c 



This ratio is plotted in Fig. [2] as a function of A with the 
mass ratio g as parameter (bold lines) and k = 1. Since 
the dependency on 77 in Eq. (|30[) is very weak in the range 
20 < 77 < 100 we plotted the ratio only for 77 = 50. The 
figure clearly shows that the stars are bound much stronger 
in the potential of the binary than that of the cluster. The 
ratio increases with A, but only for very large A the energies 
become comparable, i.e. A ~ 500 and A ~ 5 x 10 9 for q — 1 
and 0.1, respectively. Such large clusters result in masses of 
about 2000/i and 2 x 10 10 /i for q — 1 and 0.1, respectively, 
which are unrealistically large. Thus, even for major mergers 
it is justified to neglect the potential of the cluster with an 
index 7 = 2 in order to compute the ejected mass that allows 
the B Hs to mer ge. 

In I Paper J we showed that while the binary probably 
does not decay into the third phase if the density profile 
is as flat as 7 = 2, it will enter the final phase for steeper 
profiles. Repeating the above analysis for a power law with 
the index 7 = 3 we obtain for the cluster potential 



*c(r) 



GM C 



1 



1 



ln(r c /n) V n r c 

1 (x-JL 

ln(r c /n) V r c 



< r < n 
ri < r < r c 



7V < r. 



(31) 



and hence for the energy of the cluster in its own potential 
in the range from a g to ah 



E c = 



GM, 



1 



, (1 - 77 + In 17) . 

r c (In 77) 1 



(32) 




Figure 2. The energies of the cluster in the potential of the 
cluster itself and the binary as a function of the relative cluster 
size A = r c /ah- The bold and thin lines are for flat (7 = 2) and 
steep (7 = 3) profiles, respectively, (k = 1, r\ = 50.) 



Assuming as before that the mass m c j is distributed in the 
cluster between a s and ah we obtain from Eqs. (|4]) and <(9j 

In 77 



fc lnC ln«/A)- 



For n — a s (i.e. £ = A77) the ratio of the energies is 



1 

2k 



(1 + ciV 



2- 



(A + l)ln77 



(34) 



(35) 



A(r?-1) 

This ratio is displayed in Fig. [2] by the thin lines. We can see 
that for the steeper profile the cluster potential contributes 
an even smaller fraction to the potential energy of the stars 
and increases much less with A (i.e. is almost constant) than 
in the case of a shallower profile (bold lines). The steeper 
the density distribution, the stronger the stars are bound 
to the BHs. For a density distribution with a power law in- 
dex as steep as 7 = 3 the contribution of the self energy 
of the cluster is negligible, even in the limit of large ratios 
A = r c /ah. The term in the square brackets of Eq. (|35|l then 
tends to 2 — ln(77)/(?7 — 1), having a maximum of 2 if 77 tends 
to infinity so that the ratio E c / '-Ebin approaches a maximum 
of g/(l + g) 2 , what is 0.25 and only 0.083 for g = 1 and 
0.1, respectively. While neglecting the cluster's potential is 
well justified for density profiles with 7 = 2 it is an even 
better approximation for s teeper m ass distributions. Thus 
the results we obtained in IPaper j should be a reasonably 
good approximation in the limit that the cluster contains a 
mass within the radius ah which corresponds to about the 
binary's mass. Therefore we can be confident in our results 
which predict a successful merger of the BHs after the ejec- 
tion of about m G j. 



In the same range the energ y of the cluster in the potential 
of the binary is (Eq. (11) of [Paper il multiplied with 2): 5 MASS AND DENSITY PROFILE 



GM U M C 

ln(r c /r ; ) \a 



1 



GM 12 M C 77 - 1 
a h InC 



The observed profiles of early type galaxies have been pub- 
' ' lished in various papers. lLauer et al.l l|l995t ) casted the sur- 
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face brightness distributions into the 'Nuker-law', i.e. two 
power laws with inner and outer slopes which match at 
the break radius Rb- This is typically found on scales of 
some tens of parsecs or more. They detected a bimodal 
distribution and classified sources with slopes in the range 
< (5 < 0.3 as core galaxies while they referred to steep 
profiles with /3 > 0.5 as power law galaxies. Depro j ecting 
the surface brightness distributions iGebhardt et alj (|l996 ) 
found the slopes of the luminosity density profiles to be in 
the range 0.3 < 7 < 2.5 peaking at 0.8 (core galaxies) and 
1.9 (power law galaxies). Because the outer slope of both 
types is less than 3 it has to steepen again at large radii 
to keep the mass finite. This is not covered by the Nuker- 
law. Both classes of galaxies also differ in other respects 
l|Lauer et al.lll995l) : Core galaxies have larger cores and are 
more massive and luminous with boxy or elliptical isophotes. 
They show a high velocity dispersion while they are slowly 
rotating. This is in agreement with this type of galaxies hav- 
ing undergone a major merger with redistribution of matter, 
dissipation of energy and cancellation of large amounts of 
angular momentum. This probably results in an increased 
central de nsity at t he tim e the binary b e comes hard, as we 



argued in Paper 1. Later Carollo et alj (|l997T ); iRest et all 



(2001); iRavindranath et all ^POlF " found galaxies with in- 



termediate slopes what might suggest that there is a smooth 
and continuous variation in the slopes and other properties 

of the galaxies. 

More recently iGraham et al.l (|2003T l suggested a com- 
bination of a power law and Sersic law to fit the surface 
brightness profiles. The new 'Core-Sersic' model seems to 
generate better fits to the observed distribution, including 
the outer regions, and thus keeps the total mass finite. Ac- 
cording to this model the galaxies previously classified as 
'power laws' can be fitted by a pure Sersic profile, while 
core galaxies follow a flat power la w in the inner region s and 
a Sersic profile at larger distances (|Truiillo et al . 2004). The 
slopes of the inner power law are 20-40% larger compared to 
the Nuker-fits, while the break radii are smaller by a factor 
of ~ 2-5. Thus the distribution of the slopes of the spatial 
profiles of the core galaxies peaks at about 1.14. Assuming 
a constant mass to light ratio we can use these models and 
their slopes for the density profiles. 



5.1 Ejected mass &: cluster mass 

If the mass m e j is distributed between a g and ah according 
to a power law with index 7 = 3 its ejecti on allow s the 
BHs to shrink to the final separation at = a g l|Paper J ). For 
flatter profiles we find at > a g while for steeper ones the BHs 
shrink into the range of phase 3, see Fig. 1 in that article. 
We also calculated how much mass m rq exactly is required 
to be distributed between a g and ah for other slopes than 
7 = 3 in order to allow the binary to enter the final phase. 
This mass, expressed as a fraction of m c j , is 



2-7 1 



r, - 



m c j 3 — 7 1 — r)i~ 2 In 77 



(36) 



such that for 7 = 3 the ratio is equal to 1. If the amount 
of mass which is distributed in the cusp region, i.e. between 
a g and ah, is fixed the evolution of the merger in this phase 
and hence at does not depend on the extension of the cluster. 
However, changing the parameter A — r c /a,h will influence 
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Figure 3. Bold lines, left-hand y-axis: The cluster mass in units 
of m e j is shown under the condition that the mass within ah is 
large enough to allow the binary to merge. Thin lines, right-hand 
j/-axis: fraction of the cluster mass within ah- 



the total cluster mass. Scaled to the mass which is contained 
in the cusp, M(ah), we can express it as 



M c 



(Ar?) 



3-7 



1 



M(a h ) 



n3-7 



(37) 



assuming that the density distribution at larger radii has 
the same slope as in the cusp region. Deriving this mass 
ratio from Eq. Q we again assumed that n = a g so that £ 
can be replaced with X-q. If we require M(ah) = m rq we can 
express the resulting cluster mass with the help of Eq. (|36[1 
in units of m e j. This is plotted as function of the index 7 
in Fig. [3] (bold lines, left-hand t/-axis) . With m e j being of 
the order of 10 s Mq for a binary of a comparable mass (see 
Eq. ((Hi), Fig. [3] shows that for the BHs to merge the cluster 
has to be of about the same mass in case of steep profiles. 
The cluster mass is increasing with decreasing 7, the more 
steep, the larger the cluster is. It amounts to about 10 10 Mq 
if 7 w 1.25, 1.6 or 2.1 for A — 5, 10 or 50, respectively. 
Thus for large clusters with flat density profiles, as seen in 
core galaxies which are thought to have already undergone a 
major merger, the mass quickly becomes unphysically large, 
i.e. the binary would stall. But if the distribution is steep 
enough as we assume for an ongoing merger (7 > 2) this 
problem does not occur. 

Keeping the cluster mass fixed to m e j and plotting the 
inverse of Eq. (|37[) we obtain the fraction of m c j that is dis- 
tributed within ah and therefore accessible for ejection by 
the binary (thin curves in Fig. O scaled on the right-hand 
y- axis). We get the same curves if we restrict the cluster 
mass to m rq instead of m c j and then plot the cusp mass 
within ah in units of m rq . The mass fraction of the cusp is 
increasing with the slope. For a small 7 the decay of the bi- 
nary stops before the transition to the third phase is reached 
because there is not e nough mass available for ejection (see 
also Fig. 1 of iPaperll) . The larger A is, the less mass is con- 
tained within this range and the earlier the binary stalls. 
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For steeper distributions with 7 > 2 the fraction of ejected 
mass increases steeply with 7. Even for a cluster as large as 
A = 50 this mass is sufficient for 7 « 3 to allow the binary 
to shrin k to a radius as small as about 2a g (Eq. (12) of 
iPaper J) . 

Comparing the ejected with the cluster mass we come to 
the same conclusion as before. For large clusters with shal- 
low profiles its mass becomes unphysically large if the binary 
is supposed to enter the third phase. Therefore the decay of 
the binary would stall in core galaxies, which probably have 
already undergone a major merger. This i s in go od agree- 
ment with the conclusions drawn by iRoosI l|l98ll ) from his 
numerical three-body experiments. However, for steep pro- 
files as we expect them to be formed during mergers, there is 
sufficient mass available for ejection without the cluster mass 
becoming too large, so that the BHs will coalesce. The post- 
merger profile has to match the observed mass distributions 
and comparing both we hope to obtain more information 
about the properties of the cusp. 



5.2 Mass distribution during and after the merger 

In Eq. 1(3} we can express the infinitesimal mass in terms of 
the density, dm — Aur 2 p(r)dr, and rewrite this equation in 
the form 



dm u , 2 1 \ 



(38) 



Solving for the density distribution of the cusp we obtain 
W« = ^" S . ( 39 ) 



i.e. a profile with a power law inde x 7 = 3. This distribution 
represents the solution in Fig. 1 of |Paper~3 . where for 7 = 3 
and A = 1 all lines go through af/a s = 1. This means that 
m c j is distributed with just the right steepness that after 
its ejection the binary has shrunk to the semimajor axis a g 
where gravitational radiation starts to dominate the further 
decay. With the slope 7 = 3 the BHs eject the mass dm 
in the distance r owing to which the binary shrinks by an 
amount dr that corresponds precisely to the thickness of 
the spherical shell containing dm. If the density would fall 
below that of Eq. (|39[1 somewhere in the range between o g 
and ah and we ignore the mass outside the shell Anr 2 dr, the 
binary would stall at this distance. For a steeper distribution 
more matter is bound deeper in the potential at smaller radii 
and its ejection would allow the binary to decay to radii 
smaller than a g . If we assume that Eq. (|39[) is the initial 
profile, all the mass becomes ejected for r < and a hole 
remains in this range after the merger. But if the initial 
profile exceeds that of Eq. (|39|) we could substract the latter 
from the former to compute the density distribution after 
the BHs have merged. 



Non-core profiles 

As iTruiillo et alj (|2004l ) showed, a Sersic model is a very 
good approximation to surface brightness profiles, and as- 
suming a constant mass to light ratio we can write for the 
surface density: 



£(7?) = £ exp 



l/n 



(40) 



where R is the 2-dimensional radius and £0, R e and n are 
parameters. While £0 is just a scaling factor we can re- 
late R e to n if we require that a disc of radius R c con- 
tains half of the total mass. Integrating Eq. (140 [) from the 
center to R c and infinity the comparision of the obtained 
masses yields the relation T(2n) = 2T(2n,b). F with one 
argument denotes the gamma function and with two argu- 
ments the i ncomplete gamma function, w here we used the 
notation of lAbramowitz fe Stegunl (ll972Tl. A good approxi - 
mation to this relation is given bv lPrugniel fc Simienl (11997) 
with b = 2n - 1/3 + 0.009876/n. They also deprojected the 
surface density profile and gave a simplified fit to the spatial 
density distribution which can be written as 



Pgai(r) = pa ( — j exp 



. Rc 



l/n 



(41) 



This is a Sersic model multiplied with a power law where 
po — £o/-R G and r is the 3-dimensional radius. The 
surface integral of Eq. (|40p should give the same total 
mass as the volume integral of Eq. (|4ip and we obtain 
T(2n) = 2b n(p ~ 1) rfn(3 - p)]. T h is allo ws to compute p 
as function of n. iMarquez etail ll2000h give an update d 
version of the approximation by iLima Neto et al.l \ 19991) . 
p = 1.0 - 0.6097/n + 0.05563/n 2 . Note that the half mass 
radius of the spatial distribution is slightly larger than of 
the surface profile , r c /R c = 1.356 - 0.0293/n + 0.0023/n 2 
l|Lima Neto et ail) . 

Non-core galaxies do not show signs of a recent merger 
like core galaxies. On the other hand the innermost data 
points used for fits are larger than ah, and therefore a tran- 
sition at this distance to a possible inner power law could 
not have been resolved. By the time the binary becomes 
hard we assume the mass to be distributed according to one 
of the following models: (a) According to the deprojected 
Sersic model, which describes the resolved part very good 
and continues farther down to the center. After the BHs 
have merged we have to substract the ejected cusp from the 
deprojected Sersic profile, which therefore has to be at least 
as large as that of the ejected cusp at r = a g . In model (b) 
the mass distribution is that of a deprojected Sersic model 
with an additional cusp in the center when a — ah, leav- 
ing behind a deprojected Sersic profile after the BHs have 
ejected the cusp. The distribution of the ejected mass frac- 
tion in the cusp region for both models is: 



(42) 



Pi = Ph\ — 

\a h 



It has to match Eq. (|41|l at r — a g and ah for model (a) and 
(b), respectively. The volume integration of Eq. (|42p over 
the cusp range yields the ejected mass fraction M c j . 

Equating both mass distributions at r — a g we obtain 
for the galaxy profile of model (a): 



Pgal,(a) — Ph 



■ exp 



1 

T)X e 



(43) 



with x = r/ah and x c = R c /a^. ph is obtained from Eq. <(3j 
using ro = r c = Oh, n = a g and M c = M B j. We denote the 
term on the right hand side of the curly bracket of Eq. ([3| 
with 3(7,??) and thus find ph = 3(7, Tj) M j/(47ra h ). The vol- 
ume integration then yields for the total mass of the galaxy 
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Figure 4. Mass of a non-core galaxy as function of the slope 7 
of the ejected profile. x c = 10 3 and m c j fa 10 s Mq. The bold and 
thin lines in model (b) are for r\ = 20 and 100, respectively. 



in units of the ejected mass 



r 3-p 



6"(3-p) 



< ? ( 7 , ?7 )r[n(3-p)] e ,,(1/ ^ )1/ " (44) 



For model (b) with the condition p 7 (ah) = Pgai(ih) we ob- 
tain for the density and the mass: 



Pgai,(b) = Ph— exp \ -b 



and 



gal,(b) 



r 3-p 



g(j,r])r[n(3-p)] e 



b(l/xc) 1/rL 



(45) 



(46) 



While the density given in Eq. (|43p refers to the initial pro- 
file from which we have to substract the cusp of Eq. (|42|) to 
obtain the final distribution, Eq. (|45[) is the expression for 
the final profile after the ejection of p 7 . Because the ejected 
mass is only a minor fraction of the total galaxy mass it 
does not make a difference whether we compare the initial 
or final galaxy masses. Note that they are actually lower 
limits for a successful merger. Assuming that M e j = m rq of 



Eq. (|36|l we plot the galaxy mass in units of m e j as function 
of the inner slope 7 in Fig. \$\ For all plots we used a ra- 
tio R c /a h = 10 3 because [Truiillo et all (|2004T ) obtained for 
the fits of non-core galaxies half-light radii typically on the 
kpc scale. While we obtain ascending curves (solid, dashed, 
dotted) for model (a), they are descending (short dashed, 
dash-dotted) in case of model (b). If we keep the ejected 
mass of the cusp constant between a g and ah but increase 
the slope 7 of its distribution the density increases at the 
inner edge, being compensated for by a decrease at the outer 
edge. Because the densities of model (a) and (b) match the 
cusp at a g and ah, respectively, we obtain the different de- 
pendencies on 7. For galaxies with a mass of some 10 12 Mq 
or less Fig. U shows that possible solutions for model (a) 
with 7 > 2 require a large shape parameter n > 7. We also 
find small cusps (77 ~ 20) to be more likely than larger ones. 



Figure 5. Density profiles. Model (a): The dotted lines with pa- 
rameters 7 = 3 and 2 (upper and lower pair, respectively). Initial 
and final profiles are the upper and lower branch of each pair. 
n = 10, r) = 50. Model (b): solid lines from top to bottom: initial 
profiles with 7 = 3, 2 and final profile for n = 4. Dashed line: final 
profile with n = 4. A core profile is shown by the thin dash-dotted 
line with n = 6. 



If on the other hand an additional cusp is formed in the 
center when a = ah we do not find tight restrictions for the 
parameters. In the range 7 > 2.5 all shape parameters n > 4 
are possible. The size of the cusp r\ (20 and 100 for bold and 
thin lines, respectively) has only a small infl uence on the 
total mass of the galaxy. iTruiillo et al.l (|2004T ) obtained for 
their best fits to non-core galaxies an average shape param- 
eter of about 5. This basically excludes model (a) and is in 
very good agreement with steep cusps (7 > 2.5) in model 
(b) so that shallower profiles seem to be less likely. 

In Fig. [S] we plotted the density profiles for both mod- 
els. The upper branch of both dotted pairs of curves shows 
the initial profile of model (a), i.e. a deprojected Sersic model 
distribution with n = 10 and r\ = 20. After the BHs have 
coalesced and stars have been ejected from the center we ob- 
tain the lower branches as final profiles. For the upper and 
lower pair we used 7 = 3 and 2, respectively, for the ejected 
distribution. In the latter case we multiplied the final profile 
in the region r < ah with a factor of 1.5 in order to avoid 
a kink at r = ah. This does not matter because Fig. [S] is 
meant to only give a rough idea about the possible shape of 
the profiles. Model (b): Both solid (n — 4) upper branches 
depict the initial profile for 7 = 3 (top) and 2. The lower 
branch is the final profile which is a pure deprojected Sersic 
model and extends to the outer regions. The dashed line 
shows the final distribution for n = 10, which has a slope 
almost as steep as 2. The ratio of the inital and final mass 
contained in the cusp region varies between 1.8 and 3.3 for 
the parameters (77, n) = (20, 10) and (100,4), respectively, if 
7 = 2.5. For 7 = 3 we obtain a ratio between the limits 3.5 
and 8.5 for the same sets of parameters. Hence this amount 
of mass has to be added to the mass of the relaxed profile in 
the cusp region during the merger if the BHs are to coalesce. 

Our results suggest that if non-core galaxies are merger 
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remnants then model (b) is more likely than (a) and a cusp 
has been formed during the merger with an index 7 > 2.5 
when a = ah, which is subsequntly removed by the merging 
BHs. This is also what we expect: It would be quite unlikely 
that an undisturbed deprojected Sersic profile is maintained 
during the merger, especially when both cores merge and 
the binary becomes hard, without the formation of an inner 
cusp. 

Core profiles 

The surface brightness of core galaxies is well fitted by a 
Sersic model in the outer parts and by a flat power law at 
distan ces smaller than the break radius Rb, see lTruiillo et al.1 
To keep things simple we will not try to deproject the 
core-Sersic profile. Instead we use a power law with the index 
S in the core region below the 3-dimensional break radius rj,, 
which we identif y with Rb, and a deproject ed Sersic profile at 
larger radii, like lTerzic fc Graham! (|2005l ). For an infinitely 
sharp transition between both regimes the densities in the 
core and outer galaxy region can be written in the form: 



Pcorc — pb 



Pgal = Pb \ — ] < X|) 



(47) 
(48) 



Here x denotes the radius in units of ah so that Xb = ri/ah 
and x e = R e /ah- Both pro files match at Xb with the density 
Pb- Because iTruiillo et al.l found the slope of a core-Sersic 
approximation to be about 30% steeper than Nuker fits in 
the core region we use 8 — 1.14 for the de projected slopes 
instea d of 0.8 what has been obtained by iGebhardt et al.l 
(1996) when deprojecting Nuker fits. For the seven core 



galaxies of the sample of Truiillo et al.l the average ratio of 
the half light radius and the break radius is R c /Rb ~ 70. We 
assume that this ratio is roughly maintained for the depro- 
jected spatial radii. Because R a in this sample is typically on 
the kpc scale so that x B = 10 3 we find for the spatial break 
radius Xb ~ 15, in units of ah- For a flat core to be able to 
exist within rb its slope has to be less steep than the slope 
a of the deprojected Sersic profile evaluated at x = x^- 

l/n 

n / .1 i 

'+ ■ 



d loe 



Pgal 



d log x 



(49) 



The condition a > S is always fulfilled for n > 2 and hence 
imposes no restrictions on our model. The break radius is 
larger by a factor of about 10 compared to ah, which is 
on the parsec scale and thus very close to the radius of the 
innermost data point which has been used in the fits. Hence, 
if the profile of core galaxies is about the same at r > ah 
before and after the merger, it will have an additional cusp 
p 7 at r < ah when the binary becomes hard. By the time 
the BHs have coalesced we assume this cusp to have been 
removed and the profile of the core region (ah < r < rb) to 
extend also into the cusp region. Matching both profiles at 
x = 1 we obtain pb = PhX^ s and find for the initial profile 
&t a g < r < ah- 

(50) 



Pcusp — P~{ ~\~ Pcorc — Pl\ yX -\- X 

The complete mass distribution of such a core galaxy from 
the cusp to the outer galaxy region is depicted by the thin 



dash-dotted curve in Fig. [5] for the time when a = ah- The 
used parameters are 7 = 3, 6 = 1.14, Xb = 15, x c — 10 3 and 
n — 6. Performing the volume integrations of all the compo- 
nents of the density profile over their respective regions we 
obtain for the masses: 

J-31 
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3-S 



M core = M e j 3(7,77) ■ 
A/ g al = M c j p(7,77) 
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l/n 



(51) 
(52) 

(53) 



As before we used ph = 3(7, 77) M e j/(47rah) with 3(7,77) 
being the term right of the curly bracket in Eq. ((3} with 
ro = r c — ah and n = a g . For a successful merger we identify 
M e j with 77i rq of Eq. (136[) and plotted the total mass of the 
galaxy M to t = M cuap -|-Mcore-|-Mgai in units of ra c j ~ 10 s M 
as function of 7 in Fig. HJ] We used Xb = 15, x c = 10 3 and 
S = 1.14. The bold and thin lines are for 77 = 100 and 20, 
respectively, and from top to bottom we used as shape pa- 
rameter n = 4, 6 and 10 for each set of curves. The figure 
shows that the lower mass limit that enables the BHs to 
merge decreases with increasing 7. Because the cusp profile 
has to match the flat core distribution at r — ah, which in 
turn matches the deprojected Sersic model at r = rb, the 
mass of the core and galaxy region decreases with increas- 
ing 7. In fact, for flat cusps with 7 < 2 the lower mass limit 
might actually be too large. While for 7 < 2.7 small cusps 
with 77 = 20 result in smaller li mits than 77 = 100 we find 
this reversed for steeper slopes. ITruiillo et al.l found shape 
parameters of n ~ 5 what argues for 7 > 2.5 and a transient 
cusp if the total mass should not exceed 10 4 m e j . 

Instead of the deprojected Sersic model we also used the 
Sersic law of Eq. (|40p for the spatial density distribution at 
r > rb- We just replace R and R e with r and r e , respectively, 
and substitute Eo with po- If we use a three-dimensional 
Sersic model at all radii the condition that a sphere of radius 
r c contains half of the total mass leads to a very similar 
relation as in the 2-dimensional case between the factor 63 
in the exponent and the shape parameter n, 2r(3n, 63) = 
r(3n) . This can be a pproximated by 03 = 377,-1/3+0.0079/77 
l|Merritt et al.|[2006l ). While the cusp and the core are the 
same as before we obtain for the mass in the region r > rb 
with the 3-dimensional Sersic law 



M 



3n, 03 ( — 

Xe 



1 1 r, 



b 3 (x b /x c ) 1/r ' 



(54) 

This expression is also obtained by simply setting p — and 
replacing b with 03 in Eq. (|53[1 . The lower limit for the total 
mass of this model is shown by the dotted lines in Fig.|6]and 
is typically larger by a factor of a few compared to the depro- 
jected Sersic model. Hence the latter profile might be more 
appropriate for the mass distribution in galaxies which is de- 
rived from surface brightness profiles, while the former seems 
to describe galaxy sized dark matter halos slightly better 
l|Merritt et al.l l2006). On the other hand we can not iden- 
tify the shape parameters of both models with each other 
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Figure 6. Total mass Mtot = M CU sp + M CO re + M ga i as func- 
tion of the slope 7 of the ejected profile. Fixed parameters are 
(J, Xb,x c ) = (1.14, 15, 10 3 ). Each set of three curves is, from top 
to bottom, for the shape parameters n = 4, 6 and 10. The solid 
lines are for the deprojected Sersic profile and dotted lines for a 
three-dimensional Sersic profile with r) = 100. m c j fb 10 s Mq . 



and would expect n of the 3-dimensional Sersic model to 
be slightly larger than n of the deprojected 2-dimensional 
Sersic profile of the surface brightness. Thus the resulting 
curves should be closer to each other than shown in Fig. HJ] 
For both, non-core and core galaxies we find in Figs. 2] 
and [5] that for a sucessful merger generally less mass is re- 
quired for large n. How e ver, o n average the sources con- 
sidered bv lTruullo et al.l l|2004l ) have a shape parameter of 
about 5 which is in favour of a steep cusp which is transiently 
formed (model (b) for non-core galaxies). If core galaxies 
are merger remnants such a cusp has been removed during 
the second phase. Constructing their profiles with an addi- 
tional cusp at the time when the binary becomes hard yields 
galactic masses which are about 10 12 Mq or less for n « 5, 
provided that the slope of the ejected mass distribution is 
7 > 2.5. This confirms our conjecture that a steep cusp is 
formed transiently in the central regions and is in favour 
of core galaxies being rather the end product of a merger. 
However, in this simple model we would expect the break ra- 
dius in the merger remnant to roughly coincide with ah, the 
distance where the binary becomes hard. This is not clearly 
defined and ah might actually be larger than assumed here. 
Maybe before the binary becomes hard the inner parts of the 
cluster surrounding each BH are so deeply bound in its po- 
tential that they increase the effective mass of the BHs. That 
would result in sligshot ejection of stars at larger radii before 
the naked BHs form a hard binary, i.e. a shift of ah to larger 
distances. Increasing the size of the cusp at the expense of 
the core region, i.e. increasing ah while keeping rj, constant, 
would shift the matching point of both profiles to larger radii 
where the steeper cusp profile has decreased to smaller val- 
ues, see the core profile in Fig. [S] Consequently the mass of 
the matching core and galaxy region would decrease below 
the values shown in Fig. [6] so that Mtot becomes ~ 10 4 m e j 



even for y m 2. It is also possible that the difference between 
Oh and Vb, of about a factor of 10, is caused by relaxation 
processes. We would expect a merger which shifts matter 
from the region inside ah to larger distances also to change 
the steepness of the profile as well as to smooth out sharp 
transitions. In the post-merger profile the former transition 
between the ejected and remaining distribution might be 
much closer to the observed rb than ah. Eccentric orbits will 
also result in larger ah, see the next Section. 

In some galaxies the inner density profile has 
been observed to actually decrease towards smaller radii 
|Lauer et al.ll2002h . Depending on how close the ejected pro- 
file approaches the initial distribution all slopes less then 
the initial one, even holes in the profile, are possible in 
the merger remnant. The density distributions shown by 
lLauer et al.l do not resemble a power law in the inner parts 
and are more similar to the profiles resulting from simula- 
tions of a fixed binary embedded in a stellar cluster (Fig. 1, 
IZier fc Biermannl|200ll ). Although these profiles would be in 
agreement with a stalled binary they could also be gener- 
ated by completely merged BHs, depending on the initial 
distribution. 

5.3 Possible deviations from the profile 

According to our analysis the profile fulfills the following 
conditions when the binary becomes hard: It allows the BHs 
to merge completely without the cusp and galaxy becoming 
too massive and is generally in agreement with observed 
post-merger profiles after a fraction of the cusp has been 
ejected by the binary. In the cusp region the profile is steep 
(7 > 2.5) turning into a power law at ah with index S ~ 1 
before it becomes a deprojected Sersic profile at rb ~ 10 ah. 

If the orbits of the stars in the cluster would be eccen- 
tric instead of circular, as we assumed, and we keep the peri- 
center fixed, the kick-parameter would be smaller (Eq. @). 
This would result in a larger mass which has to be ejected for 
coalescence (Eq. (O), but also in a less steep cusp. Stars with 
apocenters r+ > ah and pericenters r_ < ah will probably 
be close to r+ , where they spend most of the time of the or- 
bital period, when the binary becomes hard. Half a period 
later they interact with the BHs and can become ejected. 
These stars could compensate for the additional mass which 
is required for a successful merger for cusps with 7 < 3. The 
ejection of these stars shifts ah to larger radii and helps to ex- 
plain the difference between ah and the break radius because 
their apocenters populate the region between ah and rb- For 
apocenters r+ > the eccentricity would become larger 
than (rb — a^)/(rb + a,h) ~ 0.8, what might be unlikely. How- 
ever, the exact shape of the initial profile will depend on the 
mass and velocity distributions in the isolated galaxies prior 
to their collision as well as on the orientation of both galactic 
spins and their orbital angular momentum relative to each 
other. The amount of dissipated energy, cancelled compo- 
nents of angular momentum and mass with low angular mo- 
mentum which is funelled into the central regions depend 
on these parameters. Each of the BHs will carry a stellar 
cusp of about its own mass. By the time the BHs become 
hard both cups will have merged and together with other 
matter that has accumulated in th e center they form a new 
massiv e and steep cusp. Fig. 3 of iMilosavlievic fc Merrittl 
|200l[ ) suggests that the slope of this cusp is substantially 
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steeper than 7 = 2. However, it needs to be clarified whether 
this is physical or due to spurious numerical relaxation. The 
post-merger profile will also be influenced by the time scale 
on which the binary decays in the second phase (Section [6]). 
This will depend on the eccentricity of the BHB and also on 
the initial distribution of the mass and velocity of the stars 
in the cusp. 

Our derivation above for the density distribution might 
be oversimplified because of the following possible devia- 
tions: A fraction of stars which has been ejected from the 
region r < ah will stay bound to the cluster, increasing the 
density at larger radii (Section I3.1|l . The binary also heats 
the remaining stellar population at r < ah, further dimin- 
ishing the cluster's density in this range. Due to the mass 
transfer from the inner to the outer regions of the cluster the 
stars which remain in the center are not as tightly bound as 
before and consequently extend to larger radii. All this redis- 
tribution of mass leads to a profile which is shallower than 
the difference between the initial and the ejected mass dis- 
tributions. However, computing the difference between the 
core-Sersic model and the Sersic part extrapolated into the 
core region, iGraharr] (J200J) obtains mass deficits between 1 
and 2 times M12 , depending on which method is used to de- 
termine the post-merger BH mass. Using Eqs. @ and !f55|) 
we find for the ejected mass which allows the binary to enter 
the final phase: 

q 2 '"V 1 ^ 

= JT+qT 3^ 1 - 5F 5 {n ~ 1} ' (55) 

This ratio is displayed in Fig. [Jj as function of q with the 
slope 7 and the cusp size rj as parameters. For a single major 
merger of a galaxy this is in very good agreement with the 
results of iGrahaml . Deriving Eq. (J55J we assumed efficient 
energy extraction via slingshot ejection (i.e. e = and k = 1) 
and therefore consider the ratios in Fig. [JJ as lower limits. 
Hence for mass ratios between 1 and 2 the cusp seems to be 
steeper than 7 = 2 with 7 > 2.5 being more likely and thus 
strengthening our arguments. 

5.4 Evolution time scale of an isolated cusp 

A first approximation to the relaxation time of a col- 
lisionless system of parti cles is £ ro i ax ~ t CIOSS 0.1N/ In N 
(Binn ev" fc Tremainell 19941 ). where t CIOSS is the crossing time 
and ./V the number of stars in the system. For our cluster 
with 10 8 stars, a size of about 1 pc and a typical velocity of 
vty P ~ y GNm* /ah we obtain a relaxation time of about 
10 9 yr. However, the derivation of the expression of the re- 
laxation time assumes a constant typical velocity for the 
cluster, which requires a density profile oc r~ 2 . On the other 
hand the surface density is assumed to be constant as well, 
which is obtained for p oc r _1 for a distribution extending 
to r = 0. For a cusp with a profile as steep as 7 = 3 the re- 
laxation time will be shorter in the inner than in the outer 
regions and will also be affected by the boundary conditions. 

A more precise expression for £ ro iax, which also takes 
into account encounters between stars, can be derived with 
the help of the diffusion coefficients in the Fokker-Planck 
approximation. We call thi s the diffusion time and following 
iBinnev fc Tremaind (|l994f ) we obtain 
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Figure 7. Ejected mass required for a successful merger is dis- 
played in units of the mass of the binary as function of the mass 
ratio q = 7712/ mi. The bold lines are for r] = 100, and the thin 
lines for 77 = 20. 



with A = 6 max Wty P /2Gm» and 6 ma x being the maximum 
impact parameter, which is of the order of the size of the 
cusp. Because only the logarithm of A enters Eq. (|56p . even 
larger errors of A result only in minor errors of the diffu- 
sion time. However, the relaxation time depends sensitively 
on the velocity, which is of the order of the circular veloc- 
ity v c — \J GM(r)/r. While for a steep cusp with 7 = 3 
the density in the denominator of Eq. (|56[1 will diminish 
the diffusion time in the inner regions, the circular velocity 
counteracts this tendency and increases with decreasing r 
(being a factor of more than 3 larger for 7 = 3 than for 
7 = 2 at r w 3a g ). Fig. [8] shows tdiff as a function of the 
radius for 7 = 2 and 3. As expected the evolution times 
are much smaller at small radii for both slopes of the den- 
sity distribution. Surprisingly the relaxation times of the 
steeper cusp (bold dashed curve) are larger than of the flat- 
ter cusp (bold dotted curve) over the whole plotted range, 
even at smaller radii. This is due to the strong dependency 
on the velocity. If we replot r-diff and keep the velocity fixed 
to vtyp = yGAfc/tth we obtain the thin curves, which are 
more consistent with our expectations: In the inner parts the 
steeper cusp evolves faster, but slower in the outer regions 
than a flatter distribution with 7 = 2. The difference be- 
tween both dotted curves, increasing towards the inner edge, 
is due to the influence of the cut-off at the inner boundary 
of the mass distribution on the circular velocity. However, 
a constant velocity all over the cusp region does not seem 
to be appropriate if the slope of the mass distribution is 
different from 7 = 2. 

To check this result we compare it with the time a star 
needs to spiral inwards from a distance r > e% to a g due to 
dynamical friction. The decelerating force F = m* dv/dt is 



tdiff 



4-7T G 2 m*p(r) In A ' 



(r) 



(56) 



vi{r) \ 



with erf being the error function and \ 



(57) 
c/{V2a) 
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Figure 8. Relaxation time of the cusp as function of the radius, 
derived using diffusion coefficients. Bold lines: -uj? oc M(r)/r, thin 
lines: v c = const. 



Figure 9. The median relaxation time as a function of the cusp's 
slope. The bold curves have been derived using the diffusion coef- 
ficients while for the thin curves the dynamical friction formalism 
has been used, see Eq. I I62I I. 



ijBinnev fc Tremaineh . This force is tangential to the orbit 
and therefore causes the star to lose angular momentum per 
unit mass at a rate 

dh _ Fr 
dt m* ' 



(58) 



The star continues to orbit at the speed v c (r) as it spirals 
into the center with an orbital angular momentum per unit 
mass h = rv c . If 7 = 2 and the cusp extends to the origin 
the circular velocity is constant. The integration of Eq. (|58[) 
yields for the time to spiral inwards from an initial radius 
ri n to a g 



tfric — 



?7 



87rG 2 m,S(x)lnA jp 



(59) 



Here S(x) is just an abbreviation for the term in the brackets 
of Eq. (|57[) . and x is the radius in units of ah. If the distri- 
bution has an inner boundary at a g the circular velocity is 
not constant anymore what has to be taken into account in 
the time derivative of 1, in Eq. (|58fl . Then the time to spiral 
to the inner boundary becomes 



tfric — 



+ ln^T/Zin + sj rjx in - I) J 



^(2r/a5 in - 3)\/x in (x in - 1/77) 



(60) 



with Q = x /7Va h /(Gm»)/(4S'(x)lnA) es4x 10 9 yr for our 
cusp and M c = Nm,. Also for 7 = 3 the circular velocity 
depends on r. Integrating and solving for the time yields 



2a h 



_ ^u, h 1 N ln(x in T?) , . 

Mc_ 127rS( X )ln A VGm. ln V XSn ' [b > 

Plotting tf r i c in the same way as t,aB in Fig. [8] we obtain very 
similar curves whith only slightly smaller times. Thus the re- 
laxation time due to dynamical friction is in very good agree- 
ment with the time obtained using diffusion coefficients, con- 
firming a larger relaxation time for steeper profiles. For both 



results the relaxation time quickly drops to zero as the in- 
ner boundary of the distribution is approached. This is just 
a consequence of our assumption that the mass distribution 
does not extend to smaller radii than a g . However, real cusps 
will not have such a sharp cut-off and their profiles will ex- 
tend into the inner regions, although with a slope less steep 
than 7 = 3 (the mass has to be finite), resulting in times 
larger than those shown in Fig. [5] 

To avoid these problems we try to characterize 
the cusp by a sin g le re laxation time, as suggested by 
iBinnev fc Tremaind (|l994l ). and replace the density by the 
mean density pu = \M C / \\itr'^) inside the cusp's half- 
mass radius ru- The mean-square speed of the stars is 
best described by {v 2 } ~ 0AGM c /rh, and we use A — 
rh(v 2 } /Gm* — OAN. For the diffusion and friction time we 
then obtain 
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tdlS.h =-. — 



2x3/2 



tfric.h — 



Aty G 2 m*ph In A 
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n 



(62) 



87rG 2 m,S'(x)lnA 7 p 
where the half-mass radius in dependency of the slope is 

7 _ 3 S 1/(3-7) 

J , 7/3 

(63) 

7 = 3, 



r h 

x h = — = <, 



l + y< 
T 



1 



and po is taken from Eq. Q. Note that for 7 = 3 the half- 
mass radius is the geometrical mean of a g and a^, i.e. = 
y^OgTTh". The dependency of these times on the slopes with 
77 as parameter is shown in Fig. On average the diffusion 
coefficients (bold lines) yield slightly larger relaxation times 
than dynamical friction (thin lines). Both times decrease 
with increasing slopes, as might be expected, and increase 
with decreasing 77, i.e. increasing a g . While for small 7 the 
relaxation time does not sensitively depend on the slope it 
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quickly decreases at 7 us 3 before becoming flatter again at 
7 > 4. For both methods we find trciax to be in the range 
between 10 s and 10 9 yr for a slope in the range 2 < 7 < 3. In 
the following we assume that a cusp with 7 as 3 is conserved 
for less than 10 9 yr. 

The strict conservation of the slope 7 = 3 is only nec- 
essary in the theoretical derivation of Eq. ([39} . We assumed 
the stars to move on circular orbits and that the binary 
can only eject those stars whose radius corresponds to the 
current semimajor axis. For elliptical stellar orbits the dis- 
tribution could be flatter and extend to larger radii, but a 
more massive cusp would be required for coalescence due 
to a smaller binding energy, as explained previously. Once 
the binary has become hard it will influence all stars mov- 
ing at smaller radii than a. If the inner parts of the cluster 
collapse, maybe even into a third BH, its mass will become 
bound more deeply in the potential of the binary and hence 
its ejection will extract more energy. Only if these stars are 
so deeply bound in the potential of one of the BHs that 
the perturbations caused by the secondary are negligible the 
merger might stop. Therefore the BHs might still coalesce 
when the time since they have become hard exceeds a cou- 
ple of 10 8 yr and a p oc r~ 3 profile is not conserved. We may 
conclude that the binary covers the second phase when it 
decays from ah to a g in the time thg < 10 9 yr. 



6 TIME DEPENDENCY OF THE MERGER 

With our approach to compute the required stellar mass for 
a successfully merging BHB we can not determine the rate at 
which stars are ejected and consequently no shrinking rate 
of the binary. To derive the time dependency of the merger 
we would have to make further assumptions about the rate 
at which stars interact with and are ejected by the BHs, i.e. 
about the velocity distribution of the stars. This might be 
quite difficult during an ongoing merger. Nevertheless, one 
assumption we want to make is that the shrinking rate is 
constant once the binary has become hard, 



„ d (l\ 
H = — I — I = const, 
at \ a , 



(64) 



This behaviour has been observed for hard binaries 
and a full los s-cone in variou s numerical experiments 
llHillsl 1 19831: iQuinlanl Il996l : iMilosavlievic fc Merrittl 
l200ll ; IZier fc Biermanii l200lf ) and should therefore 



also be applicable for steep and compact cusps 
and to tri axia l gala x ies whose loss cone i s al- 
ways full jYul 120021; iHollev-Bockelmann et all l2002i: 



Hollev-Bockelmann fc Sigurdssonl 20061 ; Berczik et al.l 
20061). Note that our definition is different from that used 



by Quinlanl (|l996l ) who defined the shrinking or hardening 
rate as a dimensionless quantity. We assume that the binary 
needs the time th g to shrink from the initial semimajor 
axis ah to a g . Integrating Eq. (I64|l in time from t = to 
some time t < th g and in distance from ah at t = to a(t), 
respectively, we can solve for the semimajor axis which the 
binary has reached after the time t has elapsed and obtain 

a(t) _ 1 
ah 1 + Ha^t 
If t — thg the binary has shrunk to a g and solving for the 
hardening rate yields H — (77 — l)/ahfh g . Applying this to 
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Figure 10. Evolution of the semimajor axis (decreasing curves) 
and the ejected mass (increasing curves) with time. The shrinking 
rate is assumed to be constant. 



(66) 



Eq. (|65[1 we can rewrite the semimajor axis in the form 

a(t) _ 1 
a h 1 + (77 - l)t/t hg ' 

The integration of Eq. <(Sj gives the mass which has been 
ejected by the time the semimajor axis has shrunk to a(t) 
and we only need to replace a g with a(t) in Eq. © to obtain 
an expression for m(t). With the help of Eq. (|66[1 we can 
write the time dependency of the ejected mass as 

m(t) _ In [1 + (rj - l)t/t hs ] 



In 77 



(67) 



Both functions are plotted in Fig. \W\ for different ratios 77. 
The increasing curves show the mass evolution and the de- 
creasing ones show the change in the semimajor axis with 
time. In the beginning the evolution of the binary is fastest, 
especially the decay of the semimajor axis. It has decreased 
to 1/5 or less of its initial value after only ~ 1/5 of the merg- 
ing time thg- In the same period the ejected mass amounts 
to more than a half of the total mass m G j which the BHs 
have ejected once they reach the separation a g . After a time 
of about 0.2 th g has elapsed the evolution slows down noti- 
cably and the binary spends most of its time in the second 
phase to shrink the remaining distance from less than 0.2 
to a g = a^/rj. Hence, if a binary really stalls it will be most 
likely found in the range a g < a < 0.2 ah, see Section [8] 
This is in agreement with th e resul ts of the numerical three- 
body experiments by iRoosI |l981) which suggest that the 
binary stops shrinking at a separation of the BHs of about 
0.015 r cusp . 

Initially the mass ejection rate is rh(0) = p(rj— l)/(fcth g ) 
and exceeds the rate at the end of the second phase when t = 
th g by a factor of rj. This is probably the time when the BHB 
carves a tor us comprised of stars out of t he initial stellar 
distribution (|Zier fc Biermannl l200ll . |2002| ). In Section 15.41 
we showed that the steep profile of the cusp is conserved for 
some 10 s yr, and therefore the binary should shrink from 
to a g on a similar time scale or less. Taking th g = 5 x 10 s yr 
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and assuming a mass ratio 5 = 1 with mi « 10 s Mq we 
obtain an initial ejection rate of about 2, 5 and WMq/yt 
for rj — 20, 50 and 100, respectively. At the end of phase 2 
the ejection rate is smaller by a factor of I/77 and amounts 
only to 0.1 M@/yr, being almost independet of rj for rj 3> 1. 
This again indicates that the evolution at the end of phase 
2 is much slower than in the beginning. 

We have not taken into account any dependency on the 
density profile and velocity distribution. While the shrinking 
rate might still be a constant due to a filled loss-cone, the 
shrinking time th s probably depends on both distributions. 
This could be explored in more detail with the help of TV- 
body simulations. From our estimates in Section f5. 41 of the 
relaxation time of the cusp we deduced a time thg less than 
10 9 yr. Another estimate for the order of magnitude of thg 
could also be derived from Z-sha ped radio galaxies, as has 
been pointed out by IZierl |2005), but that is beyond the 
scope of the present paper. 



7 m cj IN MULTIPLE MERGERS 

Although our approach to the merger of a massive BHB 
is quite simple the results seem to describe such a merger 
reasonably well. However, there is a simple and easy consis- 
tency check with numerical simulations which we want to 
carry out. According to simulations more mass is ejected if 
the primary BH merges N times with a BH of mass mi/N 
than in one merger with a secondary BH o f mass mi — m\ 
(e.g. lQuinlarj|l996l ; IZier fc Biermann||200ll ). In case of one 
merger we have m c j = mi ln(ah/ig)/(2fc), cf. Eq. @ with 
q = 1. If we distribute the merging mass over N mergers 
we have m% = mi/N. When the ith merger proceeds, the 
primary's mass is that it has after the (i — l)th merger has 
been completed, 



mi 



mi + 



-mi 



mi- 



N + i 



N N 
The mass ratio during the ith merger then is 
m,2 1 



<H = 



mi,i 



N + i-V 



(68) 



(69) 



and hence the reduced mass fu = mi,i <?i/(l + <!%)■ Thus we 
can write the mass which becomes ejected during the ith 
merger as 

1 N + 



rn 1 



In 



N N + i 



(70) 



The total mass ejected in N mergers then amounts to 

^ mi / a h \ 1 ^ N + 1 - 1 

m eJ ,tot = 2^m ej>i = — In — KT , , ■ ( 71 ) 



N + i 



The sum on the right hand side is equal to N + tp(l + N) — 
+ 2N), where tjj is the Psi (Digamma) Function and can 
be approximated by N 2 /(l + N), with a maximum error 
of about 6% for N — 2. This approximation corresponds 
to keeping i fixed to 1 in the above expression for mi,j so 
that mi,i remains constant at mi,, = mi for all mergers. 
Hence the primary's mass is fixed and its growth with the 
increasing number of mergers can be neglected. The total 
ejected mass after N mergers is approximately 

mi N a h 
m cj , tot ^ — j-r^ln- (72) 



and the ratio of the mass ejected in N mergers with 7712 
mi/N compared to one merger with 777,2 = mi is 



m c j tot 



= 2- 



JV 



l + N 



(73) 



This is a function that grows with N, in agreement with 
the results of numerical simulations. In deriving this ratio 
we neglected the dependency of 77 = ah/a g on the mass 
ratio q. According to numerical experiments this ratio is 
increasing with decreasing q. Therefore, the inclusion of this 
dependency would result in an ejected mass which increases 
more steeply with N, making our result more pronounced. 



8 ONGOING MERGERS 



The results we obtained in the present article and IPaper J 
suggest that the BHB which forms after the collision of two 
galaxies most likely merges. In the introduction we cited ob- 
servational evidence for ongoing mergers where the BHs are 
still orbiting around each ot her. It has been pointed out by 
iGopal-Krishna. et al.1 |2003) that in ZRGs during the time 
between the bending of the pre-merger jet into a Z-shape by 
the secondary galaxy and the launching of the post-merger 
jet after the coalescence of both BHs we should see only the 
pure Z-shape, but no complete X-shape of the jets. So far no 
galaxies with a pure Z-shape have been observed. In the new 
and st rongly increased sample of XRGs, compiled bv lCheungl 
(2007), about three such sources out of hundred galaxies 
might have been observed for the first time, i.e. J0145-0159, 
J1040+5056 and J1206+3812. Of course this needs a thor- 
ough and detailed analysis. However, even the existence of 
these objects would not indicate that the binary has stalled 
and is rather a sign that the separat ion of the BHs is some- 
where below 30-100 kpc (|Zieij|2005l ). therefore providing a 
very important laboratory for the rese arch of o ngoing merg- 
ers and merger history. The sample of I Cheunsl also seems to 
increase the number of post-merger ZRGs, supplying more 
objects for the deprojection of the jets as has been done by 
Zicr (2005). The pure Z-shaped sources might be good candi- 
dates to look for a spatially resol ved binary like i n NGC 6240 
which has been discovered by iKomossa et al.1 (|2003h . Be- 
cause of the projected separation of the BHs of about 1.4 kpc 
this merger is current ly in the first phase. More recently 
iRodriguez et al.1 (|2006h have discovered a black hole binary 
with a total mass of ~ 1.5 10 s Mq and a projected separation 
of only 7.3 pc in the radio galaxy 0402+379, probably the 
most tightly bound binary that has been observed directly 
so far. The elliptical host galaxy shows signs of a recent 
merger and the two nuclei appear to be active, suggesting 
ongoing accretion and dissipation. Because the separation is 
seen in projection it is rather a lower limit and therefore the 
merger is still in the first phase or at most at the beginning 
of the second when the loss-cone is still full. Hence, this is 
another example of an ongoing merger where the loss-cone 
is not depleted. However, based on numerical simulations of 
a BHB in a stellar core some authors claimed that probably 
the binar y stalls in the second phase due to loss-cone deple- 
tion ( e.g. Quinla n 1996; Makino 1997; Quinlan fc Hernquistl 
ll997tlMilosavlievic fc Merrittll200ll ; iBerczik et al.ll2005l ).~To 
test this prediction, which contradicts ours of successfully 
coalescing BHs, we look for ongoing mergers which happen 
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Table 1. Sources exhibiting (semi)periodic changes in lightcurves, possibly due to a BHB. Columns: (1) source, (2) redshift, (3) intrinsic 
period, (4)-(7) BH masses, (8) current separation scaled to a s in units of 10 -3 (q = 0.1), (9) mass ratio obtained under the condition 
that a = a g , and (10) remaining time to coalesce due to emission of gravitational waves (q = 0.1). 



Source 
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log( A4rh 


/Mm) 
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a = cig 






[yr] 










[lO- 3 ] 


- log(<?) 


lo s(*g/y r ) 


(i) 


(2) 


(3) 


(4) 


(5) 


(6) 


(7) 


(8) 


(9) 


(10) 


Mrk 421 


0.031 


22.4 






7.6 


8.3 CT 


373-730 


2.7-1.6 


8.3-9.5 


Pks 0735+178 


0.424 


10.0 


8.1 


8.3 


8.2 




218-264 


3.7-3.3 


7.4-7.7 


BL Lac 


0.069 


13.1 


6.4 


7.3 


7.7 


8.4 CT 


237-1615 


3.5-0.01 


7.5-10.8 


On 231 


0.102 


12.3 






8.0 




334 


2.9 


8.1 


Oj 287 


0.306 


9.1 


7.7 


8.4 


8.1 


8.8 A 


127-364 


4.6-2.8 


6.4-8.3 


Pks 1510-089 


0.361 


0.7 






8.0 


8.6 A 


27.8-49.4 


7.2-6.2 


3.8-4.8 


3C 345 


0.595 


6.3 






8.0 


9.3 A 


61.4-214 


5.9-3.7 


5.2-7.3 


AO 0235+16 


0.940 


2.9 


8.7 


8.7 


8.0 




65.1-127 


5.8-4.6 


5.3-6.4 


3C 66A 


0.444 


0.125 






8.0 




15.7 


8.2 


2.8 


Mrk 501 


0.033 


0.063 






8.3 


9.2 CT 


3.14-7.43 


11-9.5 


-0.01 to 1.5 


3C 273 


0.158 


0.00225 






9.0 


9.2 L 


0.34-0.41 


15 


-3.9to - 3.5 


Sgr A* 


0.0 


0.3 




6.5< e ) 


6.6</> 




108-118 


4.9-4.7 


6.1-6.3 



Referen ces for the periods: M rk 421 jLiu et alll997ft. Pks 073 5+178 l lFari et al .1119971), BL L ac jFan et al.lll998l). On 231 <Liu et al.ll995h , 
Oi 287 llPursimo et al.ll2000l) . Pks 1510-089 [xie et aLll2002t) , 3C 345 llZhang : et alj|l1)9sh, AO 0235+16 dRaiteri et al.ll200ll) . 3C 66A 
llLainela et al.lll999l) . Mrk 501 llHavashida et al.lll998F> 7 3C 273 jXie et alj|l999l) , Sgr A* jZhao et alj|200ll) . Reference s for the masses : 
With the exception of Sgr A* the values i n columns (4) and (5) were take n fromlXie et al l i2002l) , in co lumn (6) fromlXie et al.l (1200411 
and in column (7) from lWang et al.l d2004h . Masses for Sgr A* are from ( e ) ISchodel et al.l ll2003h and M iGhez et al] J2003h . The indices 
a, A, L indicate the method used to determine the BH mass. See text for details. 

Note that iPeterd (| 19641 ) actually gave this expression in the 
form (da/dt) = -(64/5) G 3 iiMl 2 / (c 5 a A ) which integrated 
yields Eq. (|75[1 . Some authors alternatively used the defini- 
tion i g = |d/a| _1 evaluated at a — a g which differs form t e 
in Eq. (f75j) by a factor of 4. Combining Eqs. ((711 and CTJ9 
we obtain for the current semimajor axis in units of a s : 

a_ _ J_ / mi \~ A / tg \~* (L\^ (j±g)j? 
a g ~ 29 {l0*M Q J \W 10 yr) [yr ) g 1 / 4 ' 

(76) 

Because of the small exponent 1/4 the semimajor axis de- 
pends only weakly on the time. Scaling t s with 10 8 yr instead 
of 10 10 yr increases a by only a factor of ~ 3. The period T 
and the mass mi of the primary BH of the binary we derive 
from observations. Assuming a mass ratio q then allows us to 
compute the separation of the BHs. In Table [T] we listed the 
sources in column (1) and used their redshifts (2) to trans- 
form the observed periods into the rest frame of the source 
via Tintr = T obs /(l + z), column (3). For the central BH 
mass we found different values in the literature which have 
been obtained w ith various methods (columns (4) to (7)). 
IXie et all (|2002h assume a maximally rotating Kerr BH and 
relating the observed minimal timescales of the luminosity 
variations, on scales between 1/2 to 12 hours, to the period 
of the marginally bound orbit they obtain an upper limit of 
the BH mass which is given in column (5). The same authors 
used an expression for the Eddington-limit that includes 
the Klein-Nishina effects on the Compton scattering cross 
Section to obtain a lower limit for the mass, listed in co l- 
umn (4), a method pr oposed by iDermer fc Gehrelsl (|l995h . 
Later IXie et all (|2004l ) used again the method of the min- 
imal timescale s for a larger samp le with the results shown 
in column (6). IWang et al] l|2004h employed three different 
methods to determine the central BH mass which is indi- 
cated by the upper index in column (7). They either used 



to be in the second phase. These might still not be stalled 
and actually en route to coalescence. Only if there is a sig- 
nificant number of ongoing mergers in the second phase, 
preferentially in the range a g < a < 0.2 ah as we predicted 
in Section [6] this could indeed argue for a stalled binary. 
A still existing BHB ma nifests itse l f also in se mi-periodic 
signals in light curve s . see [Komossal (|2003l . 120061 ) and refer- 
ences therein. iKat j l| 19971 ) presented a model for OJ 287 
where the precession of the accretion disc, driven by the 
gravitational torque of the secondary BH, causes the jet to 
sweep periodically across our line of sight. Doppler-boosting 
leads to the observed variations of the luminosity and the 
period of the binary is much less than the 9yr interval of 
the lumin osity variations i n the rest frame of the g alaxy . 
Models bv lSillanpaa et al] (|l988l ) and lValtaoia et al] (|2000l ) 
relate the variations of the lightcurve to interactions of the 
secondary BH with the accretion disc and therefore the ob- 
served period corresponds to th e orbital period of the binary. 
iMerritt fc Milosavlicvic (2005) have compiled a sample of 
active galaxies whose observed periodic variabilities might 
be related to the orbital motion of the BHs. We assume that 
the intervals of these variabilities correspond to the period 
of the binary and use them together with independent esti- 
mates for the mass of the BHs to determine the separation of 
the BHs and hence the phase in which the merger has been 
observed. Kepler's third law relates the binary's period T to 
its semimajor axis, 



We scale a with the semimajor axis where gravitational 
waves start to domin ate the deca y of the binary, a g . For 
circular orbits this is (|Peterslll964l ) 



256 gVM^ 1 1/4 
5 C 5 h 



(75) 
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the Mbh — cr relation as fitted bv lTremaine et all d2002h (a). 
thelinewidth- luminosity-mass scaling relation ( Kaspi et al.l 
2000) (A), or the correlati on between the luminosi ty of the 
host g alaxy and BH mass (IMcLure fe Dunlop 200l|) (L). For 
Sgr A* iGhez et al l (|2003l ) and lSchodel et al l (20031 ) used ob- 
servations of absorption lines to determine the orbits of cen- 
tral stars and hence the mass of the BH. 

As can be seen in Table [1] the masses for some objects 
are quite different by up to a factor of 100 (BL Lac, 3C 345, 
Oj 287). There is also a contradiction for AO 0235+16 be- 
tween the lower and upper limits of the mass in columns 
(4) and (6), respectively, and between coulmns (6) and (7), 
where the latter mass is not supposed to be an upper limit. 
Notwithstanding these problems we just took the smallest 
and largest mass and computed the corresponding range 
of the current separation of the BHs using Eq. (|76|l with 
q — 0.1. The result is given in column (8) in units of 10 -3 . 
For q — 1 the obtained semimajor axes would be even 
smaller by a factor of 0.6. Assuming that the binary's cur- 
rent semimajor axis is just at the transition from phase 2 to 
3, i.e. a = a g we can solve Eq. (|76|l for the required mass 
ratio q and obtain 



1 = 



3 i/3 2x + 2 1 / 3 (V3 X 2 (27-4 X ) + 9 X ) 2/3 



6 2 /3(V3x 2 (27-4 x ) + 9x) 1 / 3 
Here we used the definition 



X 



1/12 



(4tt 



-l/3 c 5/4 



(GmiY 



5/12^2/3^-1/4 



(77) 



(78) 



The negative logarithm of the mass ratio is tabulated in col- 
umn (9) . Binaries with a smaller mass ratio are still in phase 
2, while for larger ratios gravitational radiation already dom- 
inates the decay. In the last column we listed the remaining 
time for the BHs to merge due to emission of gravitational 
waves if q — 0.1, which is obtained by solving Eq. (|76[) f° r 
t s with a = a e , 



1.5 10' 



mi 



(l0 s W. 



-5/3 



8/3 



(1 + 9) 



1/3 



■yr- 



(79) 



For q — 1 this time is shorter by a factor of about 0.12. 

The results show very clearly that all binaries with- 
out exception are already in the third phase of the merging 
process. This can also be seen in Fig. [TT] where we plotted 
column (8) vs. (10). Only if we use the lower limit for the 
mass of BL Lac the separation of the BHs is larger by a fac- 
tor of 1.6 than a g and consequently the remaining merging 
time due to emission of gravitational waves exceeds a Hubble 
time (columns (8) and (10)). Taking the mass from column 
(7), which is obtained from the Mbh —o relation and there- 
fore might be more reliable, also this source is well beyond 
the limit to the third phase. For the intermediate masses, 
log(M BH /M ) = 7.7 and 7.3, we obtain a/a s = 0.46 and 
0.68, respectively. As said before a larger mass ratio would 
further diminish the current separation of the BHs. Thus, 
even for the lower limit of the mass of BL Lac an equal mass 
binary would be in the third phase. The smaller the mass 
ratio is, the larger is the current separation and the longer 
it takes for the BHs to merge. From column (9), showing 
the negative logarithm of the mass ratio for which the sep- 
aration is a — a s and the remaining merging time is the 
Hubble time, we see that even for very small mass ratios 
the binary just enters the third phase. All upper limits are 



- 10 




Figure 11. Plot of column (8) vs. (10) of Table Q] for q = 0.1. 
Data lie on the curve defined by Eq. H76H with the errorbars cor- 
responding to the mass ranges in column (4) to (7). Horizontal 
and vertical lines indicate transition between phases 2 and 3. 



smaller than 0.1. Of course BL Lac again is the sole excep- 
tion, but only for the lower mass limit. All binaries, with the 
exception of BL Lac for the small mass limit, will coalesce 
in much less than a Hubble time (column (10)). Using q = 1 
instead of 0.1 the merging times are smaller by a factor of 
0.12 so that the binary in BL Lac will merge in less than a 
Hubble time also for the small mass limit. In case of eccen- 
tric orbits of the BHs the merging times would be further 
decreased. Thus the times we obtained are actually upper 
limits. E xplaining the variations in the lightcurves with the 
model bv lKata (119971 ) would result in much smaller periods 
of the binaries and hence in smaller separations and merg- 
ing times (Eqs. (|76l I79|l ). Consequently the binary would 
be even deeper in the third and last phase of the merger. 
Therefore, provided that the variations in the lightcurves 
are due to a BHB in the center of the galaxies, our findings 
in this Section are in very good agreement with the results 
of the previous Sections and strongly support our conclu- 
sions in lPaperll : Most likely the slingshot mechanism in the 
second phase of the merger is sufficiently efficient in order 
to extract enough energy and angular momentum from the 
binary so that the BHs can enter the final phase. Hence the 
profile of the cusp at the beginning of the second phase is 
very steep indeed, and the binary covers this phase in less 



than ~ 10 yr (Section 15. 2[) . Once gravitational radiation 
dominates the shrinking in the final phase the BHs merge 
in less than a Hubble time. It is actually striking that all 
possible non-merged BHBs are observed in all phases but 
the second, which according to loss-cone depletion models is 
the one in which binaries should most likely be found. 



9 DISCUSSION AND CONCLUSIONS 

Observational evidence suggests that BHBs, formed in a 
galaxy collision, eventually coalesce within less than a Hub- 
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ble time . Focusing on stars bound to the binary we showed in 
IPaper J that slingshot ejection of stars in the second phase 
of a merger, which is considered to be the bottleneck, is 
sufficiently efficient to allow the BHs to coalesce. The pre- 
requisite is a steep cusp which is about as massive as the 
binary when the binary becomes hard. In this paper, we 
further pursued this idea and compared its predictions with 
observations and numerical simulations. Our results verify 
and strengthen our conclusion of a steep cusp and that the 
BHs coalesce in less than a Hubble time. 

In Sections [3] and [4] we e xamined in detail the assump- 
tions on which our results in IPaper II are based: The kick- 
parameter is about 1 and we can neglect the cluster poten- 
tial when calculating the energy which the binary loses to 
the ejected stars. Our crude theoretical estimate for k is in 
agreement with the kick-parameter we derived from the data 
obtained from simulating a stellar cluster in the potential 
of a binary moving on fixed orbits i|Zierll2000l ). Comparing 
these values with those obtained in other simulations and 
scatteri ng exp eriments (|Hills fc Fullertonlll980l ; IRoosII1981| ; 
lQuinlanlll996l ; lYull2002h we found very good agreement, jus- 
tifying our assumption of fc « 1. While including the cluster 
potential tends to slightly increase the kick-parameter on 
average, we found that it can be neglected in comparision 
with the potential of the binary when computing the energy 
of the stars. The influence of the cusp on the total potential 
becomes even less for steeper profiles which are required for 
a successful merger. Hence the assumptions are well justified 
and we can be confident in our results. 

In Section [SI we derived a density profile which enables 
the BHs to merge and is in agreement with the observed 
post-merger profiles. Using a deprojected Sersic model with- 
out a cusp for non-core galaxies results in a large amount of 
total mass for a successful merger if w e use a shape p aram - 
eter n ~ 5, which has been found by iTruiillo et alj (|2004T ) 
for the best fits (cf. model (a) in Fig. 3]). If, however, a steep 
central cusp is formed which is removed subsequently by the 
binary in the second phase, like in model (b), the total mass 
is much less. To obtain the initial profile, we add such a 
cusp to a post-merger profile of a core gala xy, best approx - 
imated by a core-Sersic model with n ~ 5 ( Truiillo et alt ). 
For 7 > 2.5 this pre-merger profile allows the BHs to merge 
without the total mass becoming too large (~ 1O 12 M0), 
with a core galaxy as merger remnant, see Fig. [6] Without 
this cusp, i.e. for a flat core with 7 < 2, either a very large 
amount of mass is needed, or the binary stalls before it is 
able to enter the third phase. This is in good agreement 
with the r esults f rom n umerical three-body experiments ob- 
tained by IRoosI (|l98ll ). Hence we argue that a large and 
shallow stellar cluster is the end product of a merger while 
at the time the binary becomes hard a steep cusp is formed 
which allows the BHs to coalesce. The mass distribution of 
a core galaxy with a cusp is shown in Fig. [5] by the thin 
dash-dotted line. It follows a flat power law between ah and 
the break radius r& fa 10 a^. If core galaxies are merger rem- 
nants and the core is formed by mass ejection of the binary 
we would expect to coincide with rv This discrepancy 
might be caused by the assumption of a too small a^: Stars 
bound very tightly to their host BH might increase its ef- 
fective mass and hence the second phase might start earlier 
at larger distances. Furthermore we only considered circular 
stellar orbits. Stars moving on eccentric orbits with apocen- 



ters in the range ah < r+ < ri and pericenters r_ < ah also 
interact with the binary. The ejection of this additional mass 
will shift ah to larger radii, up to n, for e « (rj— at) / (j7+ah). 
Such stars do not extract as much energy as the more tightly 
bound stars moving on circular orbits and more mass has to 
be ejected. However, these stars could compensate for this 
mass and enable successful mergers in shallower cusps. 

Depending on how closely the profile of the ejected mass 
approaches the initial distribution all slopes for the final 
profile which are less than that of the cusp are possible. 
This even includes distributions where the density drops 
wi th decreasing radiu s, which have actually been observed 
bv lLauer et al.1 ll2002f). They might indicate that the binary 
got stalled (|Zier fc Biermann|l200ll ) , but can also be formed 
by BHs which successfully merge. The maximum of these 
distributions is observed to be at radii about a factor of 
10 larger than ah, i.e. close to 77 of core galaxies, and thus 
supports the arguments above for a larger a^. 

At the end of Section [5] we computed the ralaxation 
time of the cusp in dependency of its slope and obtained a 
couple of 10 s yr if 7 = 3. For longer times the steep profile is 
not conserved and if the binary could only eject those stars 
whose orbital radius is the same as the current semimajor 
axis, as we assumed to derive Eq. ((39}, the merger would 
stall. Of course the BHs, once they have become hard, will 
influence all stars at r < a, and only if they are very deeply 
bound in the potential of one of the BHs they will not be 
ejected. Hence deviations from p tx r~ 3 will not immediately 
cause the binary to stall. Deriving Eq. (|39[1 we assumed the 
stars to move on circular orbits. As we pointed out above 
the inclusion of eccentric orbits allows for shallower ejected 
profiles (7 < 3) which might extend to r;, instead of a^. 
The shallower cusps would have a larger relaxation time, see 
Fig. [5] Thus we expect its lifetime between 10 8 and 10 9 yr 
f° r 3 < 7 < 2.5 and the BHs to decay from ah to a g on the 
time scale thg which is similar or less. 

All observational evidence for successfully merged BHs, 
cited in the introduction, also supports our prediction of 
the transient formation of a steep cusp. But how is such a 
cusp formed? Adding adiabatically growing BHs to nonro- 
tating spherical galaxy models and seeking for equilibrium 
solutions for the cusp, lOuinlan et al.l l|l995h find slopes as 
steep as 2.5 with an initial 7 = 2 and argue that steeper 
cusps without central BHs are unlikely. However, the steep 
cusp which is required for the merger is a transient feature 
and does not need to be in a sta t e of e quilibrium. Indeed, 
Fig. 3 in iMilosavlievic fc Merrittl | 200 ll ) suggests that im- 
mediately after the merger of both cores when the binary 
becomes hard the density profile inside ah is substantially 
steeper than 7 = 2. Whether this is a real physical effect or 
due to spurious numerical relaxation still needs to be clar- 
ified. When both galaxies merge energy will be dissipated 
and fractions of angular momentum cancel each other. The 
amount of these fractions will depend on the initial mass 
and velocity distributions in the isolated galaxies and on 
the magnitude and orientation of both galactic spins and 
the orbital angular momentum relative to each other. The 
mass that is funneld into the common center of the galaxies 
will merge with the cores surrounding each BH. At this time 
the central potential is strongly nonspherical so that both 
cores will be heavily perturbed, being far from a state of 
equilibrium, with profiles that might be steeper than 7 = 2. 
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At a — ah we expect them to merge into the required steep 
cusp with a mass of ~ M12. At this stage of the merger we 
can only speculate about the central mass distribution which 
might be traced with the help of detailed numerical simula- 
tions taking into account the above mentioned initial condi- 
tions. Th ey will also influence the morphology of the merger 
product |Toomre fc Toomrdll972T ) as well as processes like 
the star formation rate and hence the final gas and star con- 
tent. After the coalescence of the BHs the velocity of the 
surrounding stars will be tangentially anisotropic at r < ah 
and radially anisotropic at larger radii, with the ejected stars 
being focused to the equatorial pla ne of the binary, the mor e 
the larger their kinetic energy is (|Zier fc BiermannllioOll ). 
However, if the central regions of the cusp undergo a core 
collapse before the BHs could merge, possibly leading to 
the formation of a third BH, this mass is still available for 
slingshot ejection by the BHs which therefore still can enter 
the final phase. Our derivation of the initial profile might 
be oversimplified. A fraction of the ejected stars will stay 
bound to the cluster at larger distances while the stars re- 
maining in the cusp region are less tightly bound due to the 
smaller mass in the center and hence will also expand to 
larger regions. This is enhanced by the energy transfer from 
the binary to these stars, which are still bound by the BHs, 
i.e. the heating of this population. Hence, there is a shift of 
mass within the cluster from the inner to the outer regions, 
resulting in a profile which is flatter than the difference be- 
tween the initial and ejected mass distributi on. However, the 
mass deficiency which has been derived by iGrahaml (I2004T ) 
from the difference between the core-Sersic fit and the ex- 
trapolation of the pure Sersic profile into the central region 
amounts to 1-2 A/12 • This is in very good agreement with 
our results, provided that 7 > 2.5, see Fig. [7] 

Although our approach to the merging of the BHs 
does not allow to determine the time dependency, we made 
use of the finding in numerical experiments that the hard- 
ening rate i s constant onc e the binary has become hard 
jHillsl Il983l; iQuinlanl Il99rj ; IMilosavlievic fc Merrittl l200ll ; 
IZier fc Biermannll200ll ). see Section|5] Utilizing this assump- 
tion we obtained the semimajor axis and the ejected mass 
as functions of time with the parameter th s , the time to da- 
cay from ah to a g . This is less than 10 9 yr as we concluded 
from the relaxation time of the cusp in Section 15.41 The 
exact value of this shrinking time will depend on the mass 
and velocity distribution of the stars and hence on the initial 
conditions of the merger. Our results showed that the binary 
elvolves fastest in the beginning of phase 2 and then contin- 
uously slows down. Therefore, we conclude that if stalled 
binaries exist at all, they will most likely be found with a 
semimajor axis in the range a g < a < 0.2 ah. This is in 
agreement with the re sults of numerical three-body exper- 
iments by IRqqsI (|l98ll ) who finds that a binary might stall 
at ok 0.015 r cusp . 

In Section [3 we performed a simple co nsistency check - 



In agreement with numeri c al simulations (lOuinla: 
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Zier fc Biermannll200ll ) we find that the ejected mass is in- 
creasing with the number N of mergers if we keep the total 
mass which merges with mi constant. This means that more 
mass is ejected in N mergers of mi with m,2 = mi/N than in 
one merger with m.2 = m\. The stronger dependency on N 
which has been found in the simulations might be due to our 
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assumption that we can neglect the dependency of the ratio 
ah/ag on the mass ratio q. Our results show that the growth 
of mi during the N mergers has a negligible influence on the 
total ejected mass. 

Ongoing merger s have been observed directly by 
iKomossa et ail (|2003h in N GC 6240 with a projec ted sep- 
aration of 1.4 kpc, and by iRodriguez et al.l l|2006h in the 
elliptical galaxy 0402+379 with a projected separation of 
both nuclei of only 7.3 pc. Thus both mergers are still in the 
first phase and are clearly far from being stalled at the end 
of the second phase in a distance of a few 0.01 pc. Other 
promising sources with st ill existing binaries could b e pure 
Z-shaped radio galaxies l|Gopal-Krishna. et al.l [2003) with 
a separation less than 30-100 kpc 1 Zierl 20051 ) . Such objects 
might have been observed for the first time in a n ew sample 
of about 100 XRG candidates, see lCheunj (120071 ). Possibly 
some of these s ources ex hibit broad-emission lines character- 
istic of quasars (|Cheund . priv . comm.) as have been observed 
only recently in some XRGs (|Wang et al. l l2003l ; lLandt" et al.l 
120061 ). This actually strengthens the conjecture that merg- 
ing BHBs are the for mation mechanism for XRGs and the 
central torus in AGN (|Zier fc Biermannll200ll . |2002| ). which 
is required by th e unification scheme for type 1 and 2 AGN 
I Antonuccill 19931 ) : The symmetry axis of the torus, which is 
surrounding the nucleus and the broad emission line region 
(BLR), is aligned with the post-merger jet. Because in XRGs 
both lobes are close to the plane of sky we consequently see 
the torus almost edge on with the BLR hidden in its cen- 
ter. This is in good agreement with so few XRGs exhibiting 
BLRs. The larger the angle between the plane of sky and 
the post-merger jet, i.e. the axis of the torus, the more from 
within the torus we can see, including the BLR, and the less 
reddened the core should appear. For such objects we pre- 
dict shorter post-merger jets in projection. According to the 
merging scenario XRGs are seen close to edge-on and hence 
they should be good candidates for sho wing a type 1 spec- 
trum in polarized light like NGC 1068 (|Antonucci fc Miller! 
Il985l) . 

Stalled binaries are expected at the end of the second 
phase when slingshot ejection of stars becomes inefficient ac- 
cording to some interpretations of numerical simulations. In 
Section [S] we tested th is prediction and made use of a com- 
pilation of 12 sources (|Merritt fc Milosavlievicll2005l ) which 
exhibit periodic variations in their lightcurves. These might 
be caused by the orbital motion of a BHB. From the ob- 
served periods and masses obtained with various methods 
we determined the current semimajor axis of the binary, as- 
suming a mass ratio q = 0.1. We find that all binaries have 
already shrunk deep into the third phase and that the re- 
maining time to coalescence in all sources is much less than 
a Hubble time. The remaining merging time increases with 
decreasing mass ratio and therefore we computed q for the 
case that it still needs a Hubble time to merge. We obtained 
values much smaller than q = 0.1 and hence even in case of 
minor mergers the binary is already in the third phase. Some 
of the mass estimates differed by large factors, in case of 
BL Lac by 100. The sm allest mass obta ined with the method 
of minimal time scales (|Xie et al.ll2002l ) gives the only source 
that is still in the second phase if q < 0.98. However, for a 
larger mass which has has been obt ained with the proba- 
bly more reliab le A/bh - g relation (|Gebhardt et al ] 120001 ; 
iTremaine et al.ll2002l ) also this binary is clearly beyond the 
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transition to the final phase. Thus, in stri king contrast to 



the p redictions of loss-cone de pletio n (e.g. Begelman et al 
198(1 : iQuinlanl Il996l: iMakincj Il997|: iQuinlan fe Herncmist 
19971 ; iMilosavlievic fc MerrittlboOll ; berczik et al-lbOOSl ) ~w, 
find all BHB candidates to be already in the phase where the 
emission of gravitational waves dominates the decay. There- 
fore a merger with a third galaxy before the BHs have coa- 
lesced and the formation of three bound supermassive BHs 
with the subsequen t slingshot ejection of one or more BHs 
fe.g. IValtonenlll996l ) is highly unlikely. 

We conclude that the BHB which forms in a galaxy 
collision merges completely. This is in agreement with the 
observation of mostly merged binaries and only few ongo- 
ing mergers with none of them being stalled in the sec- 
ond phase. Hence the slingshot ejection of stars is suffi- 
ciently effective arguing for the formation of a steep cusp 
at the time when the binary becomes hard an d which 
contains a mass of ~ Mi 2, as we derived it in IPaper J 
and the present ar ticle. Triaxial potentials where the loss- 
cone is always full (IYuI|2002|; iHolley-Bockelmann et al.ll2002l ; 



iHollev-Bockelmann fc Sigurdssorj|2006l ; iBerczik et a7]|2006l ) 
further support our arguments for a successfully merged bi- 
nary. The inclusion of dark matter into our analysis, which 
we did not consider here although the same formalism ap- 
plies, would accelerate the merger and make it even more 
likely that the BHs coalesce. If there is not enough bary- 
onic matter in the cusp to allow the BHs to merge but they 
have coalesced anyway, our approach should provide a tool 
to draw conclusions about the amount and distribution of 
dark matter in the cusp region. However, we may conclude 
that stalled binaries do not exist at all or are very rare. 
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